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MAIZE REPLICATION PROTEIN A 

FIELD OF THE INVENTION 
The invention relates to the genetic manipulation of plants, particularly to 
modulating DNA metabolism in transformed plants and plant cells. 

5 BACKGROUND OF THE INVENTION 

Replication protein A (RPA) is a single-stranded DNA-binding protein that 
is required for multiple processes in eukaryotic cells. RPA from human cells is a 
stable complex of 70-, 32-, and 14-kDa subunits. Homologues of RPA have been 
identified in all eukaryotes examined. However, only human RPA and closely 

10 related homologues can support SV40 DNA replication. 

The RPA complex appears to be highly conserved in all eukaryotes. The 
three RPA genes in budding yeast cells are essential for cell viability. 
Nevertheless, yeast RPA only partially substitutes for human RPA in the in vitro 
replication of simian virus 40 indicating that species-specific interactions between 

15 RPA and other replication proteins may be important for its biological activity. 

RPA binds tightly to single stranded DNA as a heterotrimeric complex. 
The binding activity has been localized to the 70 kDa subunit. The affinity of RPA 
for both double-stranded DNA and RNA is at least three orders of magnitude lower 
than it is for single-stranded DNA. It has been reported that RPA binds 

20 preferentially to the pyrimidine-rich strand of both 5. cerevisiae sequences and the 
SV40 origin of replication. However, studies examining the determinants of 
replication origins in 5. cerevisiae indicate that this preferential binding is not 
critical for the initiation of 
DNA replication. 

25 Subunits of RPA in the 70-, 32- and 14 kDa ranges have been identified 

from various sources. The 32kDa subunit has also been referred to as "RPA2", 
"B'\ "small", "32kDa", "P32", "P34", and "middle" subunit. For the purposes of 
this invention, the "middle" subunit is intended as the subunit having a molecular 
weight of about 32 kDa. 

30 The middle subunit of RPA has a role in cell cycle regulation; single 

stranded DNA binding; affinity of DNA binding; species-specificity of DNA 
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binding, DNA recombination, repair, replication and metabolism; and response to 
DNA damages. (Anderson (1966) Calif. Inst. TechnoL \ Seroussi et al (1993) J. 
Biol Chem. 265:7147-54, Kenny et al. (1989) Proc. Natl. Acad Sci. USA 56:9757- 
61; Brush etal. (1995) Methods Enzymol. 262:522-48; Stiggere/a/. (1994) Proc. 
5 Natl. Acad Sci. USA 97:579-83, Philipova et al. (1996) Genes Dev. 70:2222-33). 
Much research has centered on the exploration of the biochemical and 
genetic mechanisms by which cell cycle regulation of DNA synthesis is achieved. 
While there have been advances in delineating the existence of cell cycle proteins, 
more information is needed on the mechanism of action of DNA replication, 
10 recombination, and repair. Furthermore, methods for regulating or altering the cell 
cycle is needed. 

Related Literature 

Braun et al (1997) Biochemistry 56:8443-8454, report on the role of 
1 5 protein-protein interactions and the function of replication protein A. It is reported 

that RPA modulates the activity of DNA polymerase a by multiple mechanisms. 
Loor et al (1997) Nucleic Acids Research 25:5041-5046 report on the 

identification of DNA replication in cell cycle proteins that interact with 

proliferating cell nuclear antigen. 
20 Longhese et al (1994) Molecular and Cellular Biology 77:7884-7890 

report that replication factor A is required for in vivo DNA replication, repair, and 

recombination. 

Stigger et al (1998) J. Biol Chem. 273:9337-9343 provide a functional 

analysis of human replication protein A in nucleotide excision repair. 
25 Abremova et al (1997) Proc. Natl Acad ScL USA 94:7186-7191 report 

that the interaction between replication protein A and p53 is disrupted after 

ultraviolet damage in a DNA repair-dependent manner. 

New et al (1998) Nature 397:407-410 reports that RAD52 protein 

stimulates DNA strand exchange by RAD51 and replication protein A. Stimulation 
30 was dependent on the concerted action of both RAD 5 1 protein and RPA implying 

that specific protein-protein interactions between RAD52 protein, RAD5 1 protein 

and RPA are required. 
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Duttae/fl/. (1992)£Mfi07 / 7^:21 89-2 199 and Niu e/ a/. (1997)7. Biol 
Chem. 272(19) : 12634-41 report cell cycle-dependent phosphorylation of the 
middle subunit of RPA, implying a role for the subunit in cell cycle regulation. 
Bochkareva et al (1998) J. Biol Chem. 2 75(7;: 3 93 2-3 93 6 report the 
5 formation of a single stranded DNA binding site on the human RPA middle 
subunit. 

Masse/ a/. (1998)Mo/. Cell Biol 18(1 1)\ 6399-6407 report that the RPA 
middle subunit contacts nascent simian virus 40 DNA, particularly the early DNA 
chain intermediates synthesized by DNA polymerase alpha-primase (RNA-DNA 
10 primers), but not more advanced products. 

Lavrik ei al. (1998) Nucleic Acids Res 26(2) :602-601 report on location of 
binding of individual subunits of human RPA to DNA primer-template complexes 
in various elongation reactions. 

Sibenaller et al. (1998) 37(36): 12496-12506 report that differences in the 
15 activity of the middle (32kDa) and the small (14 Kda) subunits of RPA are 
responsible for variations in the single stranded DNA-binding properties of 
sacchromyces cerevisiae and human RPA, thus implying a role for the subunits in 
species-specificity of DNA binding of RPA. 

20 SUMMARY OF THE INVENTION 

Compositions and methods for modulating DNA metabolism in a host cell 
is provided. Particularly, the complete cDNA and ammo acid sequence for 
homologues of maize replication protein A (RPA) large- and middle subunits are 
provided. The sequences of the invention find use in modulating DNA replication, 

25 DNA repair, and recombination. 

Transformed plants can be obtained having altered metabolic states. The 
invention has implications in genetic transformation and gene targeting in plants. 
Additionally, the methods can be used to promote cell death particularly in an 
inducible or tissue-preferred manner. 

30 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure I provides a comparison of eukaryotic RPA large subunit amino 
acid sequences. Amino acid sequences for the RPA large subunits from 
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Sacchromyces Cerevisiae (Rfal Yeast, SEQ ID NO: 10), Schizosacchromyces 
pombe (Rfal_Schpo, SEQ ID NO: 9), Drosophila melanogaster (RfalJDrome, 
SEQ ID NO: 8;, Homo sapiens (Rfal_Human, SEQ ID NO 7), Xenopus laevis 
(Rfa_Xenia, SEQ ID NO: 6), and Oryza sativa (024183, SEQ ID NO:5) were 
5 compared with the maize RPA LS homologue 1 (ZMRPALSH1, SEQ ID NO:2) 
and homologue 2 (ZMRPALSH2, SEQ ID NO:4) using the GCG PileUp program 
utilizing default parameters. The putative zinc finger region is shown in italics. 

Figure 2 provides an expression construct for inducible expression of the 
10 maize RPA large or middle subunit antisense construct. 

DETAILED DESCRIPTION OF THE INVENTION 
Nucleotide sequences and proteins useful for modulating DNA metabolism 
are provided. The nucleotide and amino acid sequences correspond to the maize 

1 5 replication protein A (RPA) subunits. RPA is a single-stranded DN A-binding 

protein that is required for multiple processes in DNA metabolism, including DNA 
replication, DNA repair, and recombination. The RPA complex generally 
comprises subunits of approximately 70, 32, and 14 kDa. By "large subunit", 
"middle subunit 1 ', and "small subunit" is herein intended a RPA subunit having the 

20 approximate molecular weight of 70-, 32-, and 14 kDa respectively . The 

sequences of the invention comprise the large- and middle subunits of the RPA 
complex. The sequences of the invention additionally find use in modulating gene 
expression. 

Compositions of the invention include RPA nucleotide and amino acid 
25 sequences that are involved in modulating DNA metabolism. In particular, the 
present invention provides for isolated nucleic acid molecules comprising 
nucleotide sequences encoding the amino acid sequences shown in SEQ ID NOs:2 
and 4 for the large subunit, and SEQ ID NOs: 12, 14, 16, 18, 20, and 22 for the 
middle subunit. SEQ ID NO:2 and SEQ ID NO:4 correspond to the amino acid 
30 sequences for the maize RPA large subunit homologue 1 (ZmRPALSHl) and 

homologue 2 (ZmRPALSH2). SEQ ID NOs: 12, 14, 16, 18, 20, and 22 correspond 
to the amino acid sequences for the maize middle subunit homologue 1 
(ZmRPAMSHl); homologues 2 and 3 (ZmRPAMSH2 and ZmRPAMSH3); 

4 
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homologue 4 (ZmRPAMSH4); homologue 5 (ZmRPAMSHS); homologue 6 
(ZmRP AMSH6); and homologue 7 (ZmRPAMSH7) respectively. 

For the large subunit, the present invention alternatively provides the 
nucleotide sequences encoding the DNA sequences deposited in a bacterial host as 

5 Patent Deposit Nos: 98754 and 98843. For the large subunits, further are 

polypeptides having an amino acid sequence encoded by a nucleic acid molecule 
described herein, for example those set forth in SEQ ID NOs: 1 and 3, those 
deposited in a bacterial host as Patent Deposit Nos: 98754 and 98843, and 
fragments and variants thereof. 

10 Plasmids containing the RPA large subunit nucleotide sequences of the 

invention were deposited with the Patent Depository of the American Type 
Culture Collection (ATCC), Manassas, Virginia, and assigned Patent Deposit NOs: 
98754 and 98843 These deposits will be maintained under the terms of the 
Budapest Treaty on the International Recognition of the Deposit of 

15 Microorganisms for the Purposes of Patent Procedure. These deposits were made 
merely as a convenience for those of skill in the art and are not an admission that a 
deposit is required under 35 U.S.C. §1 12. 

Nucleotide sequences encoding the amino acid sequences for the maize 
RPA large subunit homologue 1 (ZmRPALSHl) and homologue 2 (ZmRPALSH2) 

20 are set forth in SEQ ID NOs 1 and 3. Nucleotide sequences encoding the amino 
acid sequences for the maize RPA middle subunit homologue 1 (ZmRPAMSHl), 
homologues 2 and 3 (ZmRP AMSH2 and ZmRPAMSH3); homologue 4 
(ZmRPAMSH4); homologue 5 (ZmRPAMSHS); homologue 6 (ZmRPAMSH6); 
and homologue 7 (ZmRPAMSH7) are set forth in SEQ ID NOs: 1 1, 13, 15, 17, 19, 

25 and 21 respectively. 

The invention encompasses isolated or substantially purified nucleic acid or 
protein compositions. An "isolated" or "purified" nucleic acid molecule or protein, 
or biologically active portion thereof, is substantially free of other cellular material, 
or culture medium when produced by recombinant techniques, or substantially free 

30 of chemical precursors or other chemicals when chemically synthesized. 

Preferably, an "isolated" nucleic acid is free of sequences (preferably protein 
encoding sequences) that naturally flank the nucleic acid (i.e., sequences located at 
the 5' and 3' ends of the nucleic acid) in the genomic DNA of the organism from 
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which the nucleic acid is derived. For example, in various embodiments, the 
isolated nucleic acid molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 
kb, 0.5 kb, or 0. 1 kb of nucleotide sequences that naturally flank the nucleic acid 
molecule in genomic DNA of the cell from which the nucleic acid is derived. A 
5 protein that is substantially free of cellular material includes preparations of protein 
having less than about 30%, 20%, 10%, 5%, (by dry weight) of contaminating 
protein. When the protein of the invention or biologically active portion thereof is 
recombinantly produced, preferably culture medium represents less than about 
30%, 20%, 10%, or 5% (by dry weight) of chemical precursors or non-protein-of- 
10 interest chemicals. 

RPA binds tightly to single-stranded DNA (ssDNA). The affinity of 
binding to double-stranded DNA (dsDNA) is three to four orders of magnitude 
lower than the binding affinity for ssDNA. Because RPA has been found to bind 
specifically to certain dsDNA sequences that seem to be involved in the regulation 
15 of transcription, modulation of gene expression may be affected by an increase or 
decrease in RPA expression in the host cell. 

RPA has a wide range of activity and therefore uses relating to DNA 
metabolism and cell cycle. RPA interacts specifically with several proteins 
required for nucleotide excision repair. Interactions with repair proteins indicate 
20 that RPA may be important for efficient damage recognition and cleavage. RPA 
additionally interacts with RAD52 protein, a protein that is essential for dsDNA- 
break repair. This interaction appears to be essential for homologous 
recombination. In this manner, expression of the nucleotides of the invention may 
promote homologous recombination by recruiting factors which are essential for 
25 recombination to occur. Thus, the methods and compositions of the invention find 
use in promoting homologous recombination. 

In one embodiment, genetic manipulation by homologous recombination 
can be improved by either expression of the RPA coding sequences of the 
invention during transformation, or by providing RPA protein. RPA protein, for 
30 example, may be provided as a coating to particles during particle bombardment. 
Alternatively, DNA constructs providing for the expression of RPA may be 
included with the DNA to be transformed. The increase in RPA during 
transformation, particularly integration of polynucleotides by homologous 
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recombination, promotes integration and insertion of the DNA sequences of 
interest into the plant genome. 

In the same manner, it may be beneficial to inhibit the expression or 
presence of the RPA protein to encourage non-specific recombination events. In 
5 this manner, antibodies, peptides, antisense oligonucleotides and the like may be 
utilized to inhibit the activity of RPA. Alternatively, antisense constructs may be 
provided to inhibit the expression of RPA and encourage non-specific 
recombination. 

Catalytic RNA molecules or ribozymes can also be used to inhibit 
10 expression of plant genes. It is possible to design ribozymes that specifically pair 
with virtually any target RNA and cleave the phosphodiester backbone at a specific 
location, thereby functionally inactivating the target RNA. In carrying out this 
cleavage, the ribozyme is not itself altered, and is thus capable of recycling and 
cleaving other molecules, making it a true enzyme. The inclusion of ribozyme 
1 5 sequences within antisense RN As confers RN A-cleaving activity upon them, 
thereby increasing the activity of the constructs. The design and use of target 
RNA-specific ribozymes is described in Haseloff et al. (1988) Nature 
534:585-591. 

A variety of cross-linking agents, alkylating agents and radical generating 
20 species as pendant groups on polynucleotides of the present invention can be used 
to bind, label, detect, and/or cleave nucleic acids. For example, Vlassov, V V. et 
al. (1986) Nucleic Acids Res. 74:4065-4076, describe covalent bonding of a single- 
stranded DNA fragment with alkylating derivatives of nucleotides complementary 
to target sequences. A report of similar work by the same group is that by Knorre 
25 et al. (1985) Biochimie 67:785-789. Iverson and Dervan also showed 

sequence-specific cleavage of single-stranded DNA mediated by incorporation of a 
modified nucleotide which was capable of activating cleavage (1987) J. Am. Chem. 
Soc. 709:1241-1243). Meyer et al. (1989) J. Am. Chem. Soc. 777:8517-8519, 
effect covalent crossiinking to a target nucleotide using an alkylating agent 
30 complementary to the single-stranded target nucleotide sequence. A 

photoactivated crossiinking to single-stranded oligonucleotides mediated by 
psoralen was disclosed by Lee et al. (1988) Biochem. 27:3 197-3203. Use of 
crossiinking in triple-helix forming probes was also disclosed by Home et al. 
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" (1990) J. Am. Chem. Soc. 112:2435-2437. Use of N4, N4-ethanocytosine as an 
alkylating agent to crosslink to single-stranded oligonucleotides has also been 
described by Webb et al. (1986) J. Am. Chem. Soc. 70£:2764-2765, Webb et al. 
(1986) Nucleic Acids Res. 74:7661-7674; Feteritz et al. (1991) J. Am. Chem. Soc. 
5 773:4000 Various compounds to bind, detect, label, and/or cleave nucleic acids 
are known in the an. See, for example, U.S. Patent Nos. 5,543,507; 5,672,593; 
5,484,908, 5,256,648, and 5,681,941. 

RPA is required for the replication of chromosomal DNA. Inhibition of 
endogenous RPA expression is deleterious to the cell, organism, or plant. Thus, 

10 the constructs of the invention can be used to selectively kill target cells or tissues. 
This can be accomplished through the use of inducible or tissue-preferred 
promoters. In this manner, the sequences of the invention may find use in 
enhancing pathogen resistance. An antisense construct for the RPA coding 
sequence is operably linked to a pathogen-inducible promoter. Upon contact with 

15 the pathogen, the RPA antisense construct is expressed resulting in cell death and 
effectively preventing the invasion of the pathogen. 

The invention is drawn to compositions and methods for inducing 
resistance in a plant to plant pests. Accordingly, the compositions and methods are 
also useful in protecting plants against fungal pathogens, viruses, nematodes, 

20 insects and the like. 

By "disease resistance" is intended that the plants avoid the disease 
symptoms that are the outcome of plant-pathogen interactions. That is, pathogens 
are prevented from causing plant diseases and the associated disease symptoms, or 
alternatively, the disease symptoms caused by the pathogen is minimized or 

25 lessened. The methods of the invention can be utilized to protect plants from 
disease, particularly those diseases that are caused by plant pathogens. 

Pathogens of the invention include, but are not limited to, viruses or 
viroids, bacteria, insects, nematodes, fungi, and the like. Viruses include any plant 
virus, for example, tobacco or cucumber mosaic virus, ringspot virus, necrosis 

30 virus, maize dwarf mosaic virus, etc. Specific fungal and viral pathogens for the 
major crops include: Soybeans : Phytophthora megasperma fsp. glycinea, 
Macrophomma phaseolma, Rhizoctonia solam, Sclerotinia sclerotiorum, Fusarium 
oxysporum, Diaporthe phaseolorum var. sojae (Phomopsis sojae), Diaporthe 
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phaseolomm var. caulivora, Scleronum rolfsii, Cercospora kikuchii, Cercospora 
sojina, Peronospora manshuhca, Colletotnchum dematium (Colletotichum 
truncation), Corynespora cassucola, Septona glycines, Phyllosticta sojicola, 
Alternana alternata, Pseudomonas syringae p. v. glycinea, Xanlhomonas 
5 campestns p. v. phaseoli, Microsphaera diffusa, Fusarium semi tectum, 

Phialophora gregata, Soybean mosaic virus, Glomerella glycines^ Tobacco Ring 
spot virus, Tobacco Streak virus, Phakopsora pachyrhizi, Pythtum 
aphanidermatum, Pythium ultimum, Pythnim debaryanum, Tomato spotted wilt 
virus, Heterodera glycines Fusarium solani, Canola : Albugo Candida, Alternana 

10 brassicae, Leptosphaeria maculans, Rhizoctonia solani, Sclerotmia sclerotiorum, 
Mycosphaerella brassiccola, Pythium ultimum, Peronospora parasitica, Fusarium 
roseum, Alternana alternata; Alfalfa : Clavibater michiganese subsp. insidiosum, 
Pythium ultimum, Pythium irregulare, Pythium splendens, Pythium debaryanum, 
Pythnim aphamdermatum , Phytophthora megasperma, Peronospora trifoliorum, 

1 5 Phoma medicagtnis var. medicaginis, Cercospora medicaginis, Pseudopeziza 

medicagims, Leptotrochila medicaginis, Fusarium, Xanlhomonas campestris p. v. 
alfalfae, Aphanomyces euteiches, Stemphylium herbarum, Stemphylium alfalfae\ 
Wheat : Pseudomonas syringae p. v. atrofaciens, Urocystis agropyri, Xanlhomonas 
campestris p. v. translucens, Pseudomonas syringae p.v. syringae, Alternana 

20 alternata, Cladosponum herbarum, Fusarium gramtnearum, Fusarium 
avenaceum, Fusarium culmorum, Ustilago tntici, Ascochyta trinci, 
Cephalosponum gramineum. Collotetnchum graminicola, Erysiphe graminis f.sp. 
tntici, Puccinia graminis f.sp. tntici, Puccinia recondita f.sp. tntici, Puccinia 
stniformis, Pyrenophora tntici-repentis, Septona nodorum, Septona thtici, 

25 Septona avenae, Pseudocercosporella herpothchoides, Rhizoctonia solani, 
Rhizoctonia cerealis, Gaeumannomyces graminis var. trilici, Pythium 
aphanidermatum, Pythium arrhenomanes, Pythium ultimum, Bipolaris 
sorokiniana, Barley Yellow Dwarf Virus, Brome Mosaic Virus, Soil Borne Wheat 
Mosaic Virus, Wheat Streak Mosaic Virus, Wheat Spindle Streak Virus, American 

30 Wheat Striate Virus, Claviceps purpurea, Tilletia tntici, Tilletia laevis, Ustilago 
thtici, Tilletia indica, Rhizoctonia solani, Pythium arrhenomannes, Pythium 
gramicola, Pythium aphanidermatum, High Plains Virus, European wheat striate 
virus; Sunflower : Plasmophora halstedii, Scleroiinia sclerotiorum, Aster Yellows, 



WO 00/15816 



PCT/US99/21277 



- Septoha helianthi, Phomopsis helianthi, Alternaria helianthi, Alternaria ztnniae y 
Botrytis cinerea, Phoma macdonaldii, Macrophomina phaseolina, Erysiphe 
cichoracearum, Rhizopus oryzae, Rhizopus arrhizus, Rhizopus stolonifer, Puccinia 
helianthi, Verticillium dahliae, Erwinia carotovorum pv. carotovora, 
5 Cephalospormm acremonium, Phytophthora cryptogea, Albugo tragopogonis] 
Corn : Fusarmm moniliforme var. subglutinans, Erwinia stewartii, Fusarium 
moniliforme, Gibberella zeae (Fusarium gramimarum) , Stenocarpella maydi 
(Diplodia maydis), Pythium irregulare, Pythmm debaryanum, Pythium 
graminicola, Pythium splendens, Pythium ultimum, Pythium aphanidermatum, 

10 Aspergillus flavus, Bipolarts maydis O, T (Cochliobolus heterostrophus) , 

Helminthosporium carbonum I, II & III (Cochliobolus carbonum), Exserohilum 
turacum I, II & III, Helminthosporium pedicellatum, Physoderma maydis, 
Phyllosticta maydis, Kabatiella maydis^ Cercospora sorghi, Ustilago maydis, 
Puccinia sorghi, Puccinia polysora, Macrophomina phaseolina, Penicillium 

15 oxalicum, Nigrospora oryzae, Cladosporium herbarum, Cunndaria lunata, 

Curvularia inaequalis, Curvularia pallescens, Clavibacter michiganense subsp. 
nebraskense, Trichoderma vinde, Maize Dwarf Mosaic Virus A & B, Wheat 
Streak Mosaic Virus, Maize Chiorotic Dwarf Virus, Claviceps sorghi, 
Pseudonomas avenae, Erwinia chrysanthemi pv. zea, Erwinia carotovora, Corn 

20 stunt spiroplasma, Diplodia macrospora, Sclerophthora macrospora, 

Peronosclerospora sorghi, Peronosclerospora philippinensis, Peronosclerospora 
maydis, Peronosclerospora sacchari, Sphacelotheca reiliana, Physopella zeae, 
Cephalospormm maydis, Cephalospormm acremonium, Maize Chiorotic Mottle 
Virus, High Plains Virus, Maize Mosaic Virus, Maize Rayado Fino Virus, Maize 

25 Streak Virus, Maize Stripe Virus, Maize Rough Dwarf Virus; Sorghum : 

Exserohilum turcicum, Colletotrichum graminicola (Glome re I la graminicola) , 
Cercospora sorghi, Gloeocercospora sorghi, Ascochyta sorghina, Pseudomonas 
syringae p. v. syringae, Xanthomonas campestris p. v. ho lei cola, Pseudomonas 
andropogonis, Puccinia purpurea, Macrophomina phaseolina, Perconia circinata, 

30 Fusarium moniliforme, Alternaria alternata, Bipolar is sorghicola, 

Helminthosporium sorghicola, Curvularia lunata, Phoma insidiosa, Pseudomonas 
avenae (Pseudomonas alboprecipitans) , Ramulispora sorghi, Ramulispora 
sorghicola, Phyllachara sacchari, Sporisorium reilianum (Sphacelotheca reiliana), 
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Sphacelotheca cnienta, Spohsorium sorghi, Sugarcane mosaic H, Maize Dwarf 
Mosaic Virus A & B, Claviceps sorghi, Rhizoctonia solani, Acremomum strictum, 
Sclerophihona macrospora, Peronosclerospora sorghi^ Peronosclerospora 
philippinensis, Sclerospora gramimcola, Fusarium grammearum. Fusarium 
5 oxysporum, Pythium arrhenomanes, Pythium graminicola, etc. 

Nemaiodes inciude parasitic nematodes such as root-knot, cyst, and lesion 
nematodes, including Heterodera and Globodera spp; particularly Globodera 
rostochiensis and globodera pallida (potato cyst nematodes); Heterodera glycines 
(soybean cyst nematode); Heterodera schachtii (beet cyst nematode); and 

10 Heterodera avenae (cereal cyst nematode). 

Insect pests include insects selected from the orders Coleoptera, Diptera, 
Hymenoptera, Lepidoptera, Mallophaga, Homoptera, Hemiptera, Orthoptera, 
Thysanoptera, Dermaptera, Isoptera, Anoplura, Siphonaptera, Trichoptera, etc., 
particularly Coleoptera and Lepidoptera. Insect pests of the invention for the 

15 major crops include: Maize : Ostrima nubilalis, European corn borer, Agrotis 
ipsilon, black cutworm; Helicoverpa zea, corn earworm; Spodoptera frugiperda, 
fall armyworm; Diatraea grandiosella, southwestern corn borer; Elasmopalpus 
lignosellus, lesser cornstalk borer; Diatraea saccharalis, surgarcane borer; 
Diabrotica virgifera, western corn rootworm, Diabrouca longicornis barber i, 

20 northern corn rootworm, Diabrotica undecimpunctata howardi, southern corn 
rootworm; Melanotus spp., wireworms, Cyclocephala borealis, northern masked 
chafer (white grub); Cyclocephala immaculata, southern masked chafer (white 
grub), Popillia japonica, Japanese beetle; Chaetocnema pulicana, corn flea beetle; 
Sphenophorus maidis, maize billbug; Rhopalosiphum maidis, corn leaf aphid; 

25 Anuraphis maidiradicis, corn root aphid; Blissus leucopterus leucopierus > chinch 
bug; Melanoplus femurrubrum, redlegged grasshopper; Melanoplus sanguinipes, 
migratory grasshopper; Hylemya platura, seedcom maggot; Agromyza parvicornis, 
corn blot leafminer; Anaphothhps obscrurus, grass thrips, Solenopsis milesta, thief 
ant; Tetranychus urticae, twospotted spider mite; Sor g hum : Chilo partellus, 

30 sorghum borer; Spodoptera frugiperda, fall armyworm; Helicoverpa zea, corn 
earworm, Elasmopalpus lignosellus, lesser cornstalk borer; Feltia subterranea, 
granulate cutworm; Phyllophaga crinita, white grub, Eleodes, Conoderus, and 
Aeolus spp., wireworms; Oulema melanopus, cereal leaf beetle; Chaetocnema 
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* pulicaria, corn flea beetle; Sphenophorus maidis, maize billbug; Rhopalosiphum 
maidis, corn leaf aphid; Sipha flava, yellow sugarcane aphid; Blissus leucopterus 
leucopterus, chinch bug; Contarinia sorghicola, sorghum midge; Tetranychus 
cinnabarinus, carmine spider mite; Tetranychus urticae, twospotted spider mite; 
5 Wheat : Pseudaletia umpunctata, army worm; Spodoptera frugiperda, fall 

armyworm; Elasmopalpus lignosellus, lesser cornstalk borer; Agrotis orthogonia, 
western cutworm; Elasmopalpus lignosellus, lesser cornstalk borer; Oulema 
melanopus, cereal leaf beetle; Hypera punctata, clover leaf weevil; Diabrotica 
undecimpunctata howardi, southern corn rootworm; Russian wheat aphid; 

10 Schizaphis graminum, greenbug, Macrosiphum avenae, English grain aphid, 
Melanoplus femurrubrum, redlegged grasshopper; Melanoplus differentialis, 
differential grasshopper; Melanoplus sanguinipes, migratory grasshopper; 
Mayetiola destructor, Hessian fly; Sitodiplosis mosellana, wheat midge; Meromyza 
americana, wheat stem maggot; Hylemya coarctata, wheat bulb fly; Frankliniella 

15 fusca, tobacco thrips; Cephus cinctus, wheat stem sawfly; Aceria tulipae, wheat 
curl mite; Sunflower : Suleima helianthana, sunflower bud moth, Homoeosoma 
electellum, sunflower moth; zygogramma exclamationis, sunflower beetle; 
Bothyrus gibbosus, carrot beetle; Neolasioptera murtfeldtiana, sunflower seed 
midge; Cotton : Heliothis virescens, cotton budworm; Helicoverpa zea, cotton 

20 bollworm; Spodoptera exigua, beet armyworm; Pectinophora gossypiella, pink 

bollworm; Anthonomus grandis grandis, boll weevil; Aphis gossypn, cotton aphid; 
Pseudatomoscelis seriaius, cotton fleahopper; Trialeurodes abutilonea, 
bandedwinged whitefly; Lygus lineolaris, tarnished plant bug; Melanoplus 
femurrubrum, redlegged grasshopper; Melanoplus differentialis, differential 

25 grasshopper; Thrips tabaci, onion thrips; Franklinkiella fusca, tobacco thrips; 

Tetranychus cinnabarinus, carmine spider mite; Tetranychus urticae, twospotted 
spider mite, Rice : Diatraea saccharalis, sugarcane borer; Spodoptera frugiperda, 
fall armyworm, Helicoverpa zea, corn earworm; Colaspis brunnea, grape colaspis; 
Lissorhoptrus oryzophilus, rice water weevil; Sitophilus oryzae, rice weevil; 

30 Nephotettix nigropictus, rice leafhopper; Blissus leucopterus leucopterus, chinch 
bug; Acrosternum hilar e, green stink bug, Soybean : Pseudoplusia includens, 
soybean looper; Anticarsia gemmatalis, velvetbean caterpillar; Plathypena scabra, 
green cloverworm; Ostrinia nubilalis, European corn borer; Agrotis ipsilon, black 
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cutworm, Spodoptera exigua, beet armyworm; Hehothis virescens, cotton 
budworm; Helicoverpa zea, cotton bollworm; Epilachna varivestis, Mexican bean 
beetle; Myzus persicae, green peach aphid; Empoasca fabae, potato leafhopper; 
Acrostermim hilare, green stink bug; Melanoplus femurrubrum, redlegged 
5 grasshopper; Melanoplas differential , differential grasshopper; Hylemya platura, 
seedcorn maggot; Sericothrips variabilis, soybean thrips, Thnps tabaci, onion 
thrips; Tetranychus turkestani, strawberry spider mite, Tetranychus urticae, 
twospotted spider mite, Barley: Ostnnia nubilalis, European corn borer; Agrotis 
ipsilon, black cutworm; Schizaphis graminum, greenbug; Blissus leucopterus 

10 leucopterus, chinch bug, Acrostermim hilare, green stink bug, Euschistus servus, 
brown stink bug; Delia platura, seedcorn maggot, Mayetiola destructor, Hessian 
fly; Petrohia latens, brown wheat mite, Oil Seed Rape Brevicoryne brassicae, 
cabbage aphid; Phyllotreia cruaferae, Flea beetle; Mamestra configurata, Benha 
armyworm, Plutella xylostella, Diamond-back moth; Delia ssp., Root maggots. 

1 5 A number of promoters can be used in the practice of the invention. The 

promoters can be selected based on the desired outcome. The nucleic acids can be 
combined with constitutive, tissue-preferred, or other promoters for expression in 
plants 

A plant promoter can be employed which will direct expression of a 
20 polynucleotide of the present invention in all tissues of a regenerated plant. Such 
promoters are referred to herein as "constitutive" promoters and are active under 
most environmental conditions and states of development or cell differentiation. 
Such constitutive promoters include, for example, the core promoter of the Rsyn7 
(WO 99/43838); the core CaMV 35S promoter (Odell et ai (1985) Nature 
25 575:810-812); rice actin (McElroy et ai (1990) Plant Cell 2: 163-171); ubiquitin 
(Christensen et ai (1989) Plant MoL Biol. 72:619-632 and Christensen et ai 
(1992) Plant MoL Biol. /S:675-689), pEMU (Last et ai (1991) Theor. Appl 
Genet 57:581-588); MAS (Veitene/a/. ( 1 984) EiVlBO J. 5:2723-2730), ALS 
promoter (U.S. Patent No. 5,659,026), and the like. Other constitutive promoters 
30 include, for example, U.S. Patent Nos. 5,608,149; 5,608,144, 5,604,121; 
5,569,597; 5,466,785, 5,399,680; 5,268,463, and 5,608,142. 

Alternatively, the plant promoter can direct expression of a polynucleotide 
of present invention in a specific tissue or may be otherwise under more precise 
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- environmental or developmental control. Such promoters are referred to here as 
"inducible" promoters. Environmental conditions that may effect transcription by 
inducible promoters include pathogen attack, anaerobic conditions, or the presence 
of light. Examples of inducible promoters are the Adhl promoter which is 
5 inducible by hypoxia or cold stress, the Hsp70 promoter which is inducible by heat 
stress, and the PPDK promoter which is inducible by light. 

Examples of promoters under developmental control include promoters that 
initiate transcription only, or preferentially, in certain tissues, such as leaves, roots, 
fruit, seeds, or flowers. An exemplary promoter is the anther specific promoter 
10 5126 (U.S. Patent Nos. 5,689,049 and 5,689,051). The operation of a promoter 
may also vary depending on its location in the genome. Thus, an inducible 
promoter may become fully or partially constitutive in certain locations. 

The promoters can be selected based on the desired outcome. When the 
genes are expressed at levels to cause cell death, an inducible promoter or tissue 
1 5 specific promoters can be used to drive the expression of the genes of the 
invention. The inducible promoter must be tightly regulated to prevent 
unnecessary cell death, yet be expressed in the presence of a pathogen to prevent 
infection and disease symptoms. 

Generally, it will be beneficial to express the gene from an inducible promoter, 
20 particularly from a pathogen-inducible promoter. Such promoters include those 
from pathogenesis-related proteins (PR proteins), which are induced following 
infection by a pathogen; e.g., PR proteins, SAR proteins, beta-l,3-glucanase, 
chitinase, etc. See, for example, Redolfi et al (1983) Neth. J. Plant Pathol. 
59:245-254, Uknes et al. (1992) Plant Cell -/:645-656, and Van Loon (1985) Plant 
25 Mol Virol. 4. 1 1 1-1 16. See also the copending application entitled "Inducible 
Maize Promoters", U.S. Application Serial No. 09/257,583, filed February 25, 
1999, herein incorporated by reference. 

Of interest are promoters that are expressed locally at or near the site of 
pathogen infection. See, for example, Marineau et al (1987) Plant Mol Biol 
30 9:335-342; Matton et al (\9%9) Molecular Plant-Microbe Interactions 2:325-33 1, 
Somsisch et al (1986) Proc. NatL Acad. Sci. USA 53:2427-2430; Somsisch et al 
(1988) Mol Gen. Genet. 2:93-98; and Yang (1996) Proc. Natl Acad. Sci. USA 
95:14972-14977. See also, Chen et al (1996) Plant J. 70:955-966, Zhang etal. 
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(1994) Proc. NatL Acad ScL USA 9/ 2507-251 U Warner et al. (1993) Plant J. 
5:191-201; Siebertz <?r a/. (1989) Plant Cell 7:961-968, U.S. Patent No. 5,750,386 
(nematode-inducible), and the references cited therein. Of particular interest is the 
inducible promoter for the maize PRms gene, whose expression is induced by the 
5 pathogen Fitsarium moniliforme (see, for example. Cordero et ai (1992) Physiol 
MoL Plant Path. 41: 189-200). 

Additionally, as pathogens find entry into plants through wounds or insect 
damage, a wound-inducible promoter may be used in the constructions of the 
invention. Such wound-inducible promoters include potato proteinase inhibitor 

10 (pin II) gene (Ryan (1990) Ann. Rev. Phytopath. 25:425-449; Duan et ai (1996) 
Nature Biotechnology /-/:494-498); wunl and wun2, US Patent No 5,428,148, 
winl and win2 (Stanford et al (1989) MoL Gen. Genet. 2/5:200-208), systemin 
(McGurl et al. (1992) Science 225:1570-1573), W1P1 (Rohmeier et al. (1993) 
Plant MoL Biol 22:783-792; Eckelkamp et al. (1993) FEBS Letters 525:73-76), 

15 MP I gene (Corderok et al. (1994) Plant J. 6^:141-150); and the like, herein 
incorporated by reference. 

Chemical-regulated promoters can be used to modulate the expression of a 
gene in a plant through the application of an exogenous chemical regulator. 
Depending upon the objective, the promoter may be a chemical-inducible 

20 promoter, where application of the chemical induces gene expression, or a 

chemical-repressible promoter, where application of the chemical represses gene 
expression Chemical-inducible promoters are known in the an and include, but are 
not limited to, the maize In2-2 promoter, which is activated by 
benzenesulfonamide herbicide safeners, the maize GST promoter, which is 

25 activated by hydrophobic electrophilic compounds that are used as pre-emergent 
herbicides, and the tobacco PR- la promoter, which is activated by salicylic acid. 
Other chemical-regulated promoters of interest include steroid-responsive 
promoters (see, for example, the glucocorticoid-inducible promoter in Schena et al. 
(1991) Proc. NatL Acad Sci. USA 88. 10421-10425 and McNellis et al. (1998) 

30 Plant J. 14(2) .247-257) and tetracycline-inducible and tetracycline-repressible 

promoters (see, for example, Gatz et al. (1991) MoL Gen. Genet. 227:229-237, and 
U.S. Patent Nos. 5,814,618 and 5,789,156), herein incorporated by reference. 
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Where low level expression is desired, weak promoters will be used. 
Generally, by "weak promoter 1 is intended a promoter that drives expression of a 
coding sequence at a low level. By low level is intended at levels of about 1/1000 
transcripts to about 1/100,000 transcripts to about 1/500,000 transcripts. 

5 Alternatively, it is recognized that weak promoters also encompasses promoters 
that are expressed in only a few cells and not in others to give a total low level of 
expression. Where a promoter is expressed at unacceptably high levels, portions of 
the promoter sequence can be deleted or modified to decrease expression levels. 

Such weak constitutive promoters include, for example, the core promoter 

10 of the Rsyn7 (WO 99/43838), the core 35S CaMV promoter, and the like. Other 
constitutive promoters include, for example, U.S. Patent Nos. 5,608,149; 
5,608,144; 5,604,121; 5,569,597; 5,466,785; 5,399,680; 5,268,463; and 5,608,142. 
See also, the copending application entitled "Constitutive Maize Promoters", U.S. 
Application Serial No. 09/257,584, filed February 25, 1999, and herein 

1 5 incorporated by reference. 

Tissue-preferred promoters can be utilized to target enhanced RPA 
expression within a particular plant tissue. In this aspect of the invention, the 
antisense constructs are useful for tissue-preferred expression. Male or female 
sterility may be affected by use of the antisense constructs with tissue-preferred 

20 promoters Although not a limitation, of particular interest are promoters for male 
sterility. For example, the anther-preferred promoter 5 126 can be used See, for 
example, U.S. Patent Nos. 5,689,049 and 5,689,051, herein incorporated by 
reference. 

Tissue-preferred promoters include Yamamoto et al (1997) Plant J. 

25 72f2;255-265,Kawamatae/a/. (1997) Plant Cell Physiol 38(7) 792-803, Hansen 
etai (1997) Mol. Gen Genet. 25^:337-343; Russell et al. (1997) Transgenic 
Res. 6(7J:157-168;Rineharte/a/. ( 1 996) Plant Physiol 1J2(3):\33 1-1341 ; Van 
Camp et al (1996) Plant Physiol 7/2^:525-535, Canevascini et al (1996) Plant 
Physiol 7/2^:513-524, Yamamoto et al (1994) Plant Cell Physiol 35(5) :773- 

30 778, Lam {\994) Results Probl Cell Differ. 20:181-196, Orozco et al (]993) Plant 
MolBiol 2^:1129-1138; Matsuokae/ a/. (\993) Proc Natl Acad Sa. USA 
90f20j:9586-959O; and Guevara-Garcia et al (1993) Plant J. ^ 495-505. Such 
promoters can be modified, if necessary, for weak expression. 
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Leaf-specific promoters are known in the art. See, for example, Yamamoto 
etal. (1997) Plant J. 72^:255-265, Kwon etal. (1994) Plant Physiol. 105:357- 
67; Yamamoto et al (1994) Plant Cell Physiol. 55(^:773-778; Gotor et al. (1993) 
Plant J. 5:509-18; Orozco etal (1993) Plant Moi Biol. 25(^:1129-1138; and 
5 Matsuoka etal. (1993) Proc. Natl. Acad Sci. USA 90(20) :9586-9590. 

Root-specific promoters are known and can be selected from the many 
available from the literature or isolated de novo from various compatible species 
See, for example, Hire et al (1992) Plant MoL Biol 20(2). 207-218 (soybean root- 
specific glutamine synthetase gene), Keller and Baumgartner (1991) Plant Cell 

10 3(10). 1051-1061 (root-specific control element in the GRP 1 .8 gene of French 

bean); Sanger et al. (1990) Plant MoL Biol. 14(3) : 43 3-443 (root-specific promoter 
of the mannopine synthase (MAS) gene of Agrobactenum tumefaciens)\ and Miao 
et al (1991) Plant Cell 3(l):\\-22 (full-length cDNA clone encoding cytosolic 
glutamine synthetase (GS), which is expressed in roots and root nodules of 

15 soybean). See also Bogusz etal. (1990) Plant Cell 2(7) ;633-641, where two root- 
specific promoters isolated from hemoglobin genes from the nitrogen-fixing 
nonlegume Parasponia andersonu and the related non-nitrogen-fixing nonlegume 
Trema tomentosa are described The promoters of these genes were linked to a (3- 
glucuronidase reporter gene and introduced into both the nonlegume Nicotiana 

20 labacum and the legume Lotus corniculatus, and in both instances root-specific 

promoter activity was preserved. Leach and Aoyagi (1991) describe their analysis 
of the promoters of the highly expressed rolC and rolD root-inducing genes of 
Agrobactenum rhizogenes (see Plant Science (Limerick) 79(7^:69-76). They 
concluded that enhancer and tissue-preferred DNA determinants are dissociated in 

25 those promoters. Teeri et al. (1989) used gene fusion to lacZ to show that the 

Agrobactenum T-DNA gene encoding octopine synthase is especially active in the 
epidermis of the root tip and that the TR2' gene is root specific in the intact plant 
and stimulated by wounding in leaf tissue, an especially desirable combination of 
characteristics for use with an insecticidal or larvicidal gene (see EJS/IBO J. 

30 5(^:343-350). The TRT gene, fused to nptll (neomycin phosphotransferase II) 
showed similar characteristics. Additional root-preferred promoters include the 
VIENOD-GRP3 gene promoter (Kuster et al. (1995) Plant MoL Biol 29(4)1 59- 
772); and rolB promoter (Capanae/a/. (1994) Plant MoL Biol 25^.681-691. See 
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- also U.S. Patent Nos. 5,837,876; 5,750,386; 5,633,363, 5,459,252; 5,401,836; 
5,110,732; and 5,023,179. 

"Seed-preferred" promoters include both "seed-specific" promoters (those 
promoters active during seed development such as promoters of seed storage 
5 proteins) as well as "seed-germinating" promoters (those promoters active during 
seed germination). See Thompson et al. (1989) BioEssays 10: 108, herein 
incorporated by reference. Such seed-preferred promoters include, but are not 
limited to, Ciml (cytokinin-induced message); cZ19Bl (maize 19 kDa zein); milps 
(myo-inositol-1 -phosphate synthase), and celA (cellulose synthase) (see the 

10 copending application entitled "Seed-Preferred Promoters," U.S. Application 
Serial No. 60/097,233, filed August 20, 1998, herein incorporated by reference. 
Gama-zein is a preferred endosperm-specific promoter. Glob-1 is a preferred 
embryo-specific promoter. For dicots, seed-specific promoters include, but are not 
limited to, bean (3-phaseolin, napin, (3-conglycinin, soybean lectin, cruciferin, and 

15 the like. For monocots, seed-specific promoters include, but are not limited to, 
maize 15 kDa zein, 22 kDa zein, 27 kDa zein, g-zein, waxy, shrunken 1, shrunken 
2, globulin 1, etc. 

Both heterologous and non-heterologous (i.e., endogenous) promoters can 
be employed to direct expression of the nucleic acids of the present invention. 

20 These promoters can also be used, for example, in recombinant expression 

cassettes to drive expression of antisense nucleic acids to reduce, increase, or alter 
RPA content and/or composition in a desired tissue, or to generate sterile plants. 
Optionally, RPA nucleic acids from a variety of sources, as discussed above can be 
employed to create male sterile plants. In optional embodiments, the RPA gene or 

25 cDNA is operably linked to an anther-specific promoter such as 5126, as discussed 
above. Preferably, the male sterile plant is maize. 

Thus, in some embodiments, the nucleic acid construct will comprise a 
promoter functional in a plant cell, such as in Zea mays, operably linked to a 
polynucleotide of the present invention. Promoters useful in these embodiments 

30 include the endogenous promoters driving expression of a polypeptide of the 
present invention. 

In some embodiments, isolated nucleic acids which serve as promoter or 
enhancer elements can be introduced in the appropriate position (generally 
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upstream) of a non-heterologous form of a polynucleotide of the present invention 
so as to up or down regulate expression of a polynucleotide of the present 
invention. For example, endogenous promoters can be altered in vivo by mutation, 
deletion, and/or substitution (see, Kmiec, U.S. Patent No. 5,565,350; Zarling et al, 
5 PCT/US93/03868), or isolated promoters can be introduced into a plant cell in the 
proper orientation and distance from a RPA gene so as to control the expression of 
the gene. Gene expression can be modulated under conditions suitable for plant 
growth so as to alter RPA content and/or composition. Thus, the present invention 
provides compositions, and methods for making, heterologous promoters and/or 

10 enhancers operably linked to a native, endogenous (i.e., non-heterologous) form of 
a polynucleotide of the present invention. 

Methods for identifying promoters with a particular expression pattern, in 
terms of e.g., tissue type, cell type, stage of development, and/or environmental 
conditions, are well known in the art. See, e.g., The Maize Handbook, Chapters 

15 114-115, Freeling and Walbot, eds., Springer, New York (1994), Corn and Corn 
Improvement, 3 rd edition, Chapter 6, Sprague and Dudley, eds., American Society 
of Agronomy, Madison, Wisconsin (1988). A typical step in promoter isolation 
methods is identification of gene products that are expressed with some degree of 
specificity in the target tissue. Amongst the range of methodologies are: 

20 differential hybridization to cDNA libraries, subtractive hybridization; differential 
display; differential 2-D protein gel electrophoresis; DNA probe arrays; and 
isolation of proteins known to be expressed with some specificity in the target 
tissue. Such methods are well known to those of skill in the an. Commercially 
available products for identifying promoters are known in the an such as 

25 Clontech's (Palo Alto, CA) Universal Genome Walker Kit. 

For the protein-based methods, it is helpful to obtain the amino acid 
sequence for at least a portion of the identified protein, and then to use the protein 
sequence as the basis for preparing a nucleic acid that can be used as a probe to 
identify either genomic DNA directly, or preferably, to identify a cDNA clone 

30 from a library prepared from the target tissue. Once such a cDNA clone has been 
identified, that sequence can be used to identify the sequence at the 5' end of the 
transcript of the indicated gene. For differential hybridization, subtractive 
hybridization and differential display, the nucleic acid sequence identified as 
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enriched in the target tissue is used to identify the sequence at the 5' end of the 
transcript of the indicated gene. Once such sequences are identified, starting either 
from protein sequences or nucleic acid sequences, any of these sequences 
identified as being from the gene transcript can be used to screen a genomic library 
5 prepared from the target organism. Methods for identifying and confirming the 
transcriptional start site are well known in the art. 

In the process of isolating promoters expressed under particular 
environmental conditions or stresses, or in specific tissues, or at particular 
developmental stages, a number of genes are identified that are expressed under the 

10 desired circumstances, in the desired tissue, or at the desired stage. Further 

analysis will reveal expression of each particular gene in one or more other tissues 
of the plant. One can identify a promoter with activity in the desired tissue or 
condition but that do not have activity in any other common tissue. 

To identify the promoter sequence, the 5* portions of the clones described 

1 5 here are analyzed for sequences characteristic of promoter sequences. For 

instance, promoter sequence elements include the TATA box consensus sequence 
(TATAAT), which is usually an AT-rich stretch of 5-10 bp located approximately 
20 to 40 base pairs upstream of the transcription start site. Identification of the 
TATA box is well known in the art. For example, one way to predict the location 

20 of this element is to identify the transcription start site using standard 

RNA-mapping techniques such as primer extension, SI analysis, and/or RNase 
protection. To confirm the presence of the AT-rich sequence, a structure-function 
analysis can be performed involving mutagenesis of the putative region and 
quantification of the mutation's effect on expression of a linked downstream 

25 reporter gene. See, e.g., The Maize Handbook, Chapter 1 14, Freeling and Walbot, 
eds., Springer, New York (1994). 

In plants, further upstream from the TATA box, at positions -80 to -100, 
there is typically a promoter element (i.e., the CAAT box) with a series of adenines 
surrounding the trinucleotide G (or T) N G. J. Messing et a/., in Genetic 

30 Engineering in Plants, Kosage, Meredith and Hoilaender, eds., pp. 221-227 (1983). 
In maize, there no well-conserved CAAT box but there are several short, 
conserved protein-binding motifs upstream of the TATA box. These include 
motifs for the transacting transcription factors involved in light regulation, 
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anaerobic induction, hormonal regulation, or anthocyanin biosynthesis, as 
appropriate for each gene. 

Once promoter and/or gene sequences are known, a region of suitable size 
is selected from the genomic DNA that is 5' to the transcriptional start, or the 
5 translational start site, and such sequences are then linked to a coding sequence. If 
the transcriptional start site is used as the point of fusion, any of a number of 
possible 5' untranslated regions can be used in between the transcriptional start site 
and the partial coding sequence. If the translational start site at the 3' end of the 
specific promoter is used, then it is linked directly to the methionine start codon of 

1 0 a coding sequence. 

If polypeptide expression is desired, it is generally desirable to include 
apolyadenylation region at the 3'-end of a polynucleotide coding region. The 
polyadenylation region can be derived from the natural gene, from a variety of 
other plant genes, or from T-DNA. The 3' end sequence to be added can be 

15 derived from, example, the nopaline synthase or octopine synthase genes, or 

alternatively from another plant gene, or less preferably from any other eukaryotic 
gene. 

An intron sequence can be added to the 5' untranslated region or the coding 
sequence of the partial coding sequence to increase the amount of the mature 

20 message that accumulates in the cytosol. Inclusion of a spliceable intron in the 

transcription unit in both plant and animal expression constructs has been shown to 
increase gene expression at both the mRNA and protein leveis up to 1000-fold. 
Buchman et al. (1988) MoL Cell Biol. 5:4395-4405, Callis el al. (1987) Genes 
Dev. 1 . 1 183-1200 Such intron enhancement of gene expression is typically 

25 greatest when placed near the 5' end of the transcription unit. Use of maize introns 
Adhl-S intron 1, 2, and 6, the Bronze-1 intron are known in the art. See generally, 
The Maize Handbook, Chapter 1 16, Freeling and Walbot, eds., Springer, New 
York (1994). 

The vector comprising the sequences from a polynucleotide of the present 
30 invention could comprise a selectable marker gene for the selection of transformed 
cells or tissues. Selectable marker genes include genes encoding antibiotic resistance, 
such as those encoding neomycin phosphotransferase II (NEO) and hygromycin 
phosphotransferase (HPT), as well as genes conferring resistance to herbicidal 
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compounds, such as glufosinate ammonium, bromoxynil, imidazolinones, and 2,4- 
dichlorophenoxyacetate (2,4-D). See generally, Yarranton (1992) Curr. Opin. 
Biotech. 5:506-51 1; Christopherson et al (1992) Proc. Natl Acad Sci USA 59:6314- 
6318, Yao et al. (1992) Cell 77:63-72; Reznikoff (1992)M?/. Microbiol 6:2419- 
5 2422; Bark ley etai (1980) in The Operon, pp. 177-220; Hu etai (1987) Cell 48:555- 
566; Brown etai (1987) Cell ^9:603-612, Figge etai (1988) Cell 52:71 3-722; 
Deuschle el al. (1989) Proc. Natl Acad Aci. USA 56:5400-5404; Fuerst et al (1989) 
Proc. Natl Acad Sa. USA 56:2549-2553; Deuschle et al (1990) Science 2-/5480- 
483, Gossen (1993) Ph.D. Thesis, University of Heidelberg; Reines et al (1993) 

10 Proc. Natl. Acad. Set. USA 90:1917-1921, Labow etai (\990)Mol Cell Biol 
70:3343-3356, Zambretti etai (1992) Proc. Natl Acad Sci. USA 59:3952-3956; 
Bairn etai (1991) Proc. Natl. Acad Sci. USA 55:5072-5076, Wyborski etai (1991) 
Nucleic Acids Res. 79:4647-4653, Hillenand-Wissman ( 1989) Topics Mol Struc. 
Biol 70:143-162, Degenkolb et al (1991) Amimicrob. Agents Chemother. 35:1591- 

15 1595; Kleinschnidt et al (1988) Biochemistry 27: 1094-1 104, Bonin (1993) Ph.D. 
Thesis, University of Heidelberg, Gossen et al (1992) Proc. Natl Acad Sci. USA 
59:5547-5551, Olivae^a/. (1992) Antimicroh. Agents Chemother. 36 913-919; 
Hlavka et al (1985) Handbook of Experimental Pharmacology, Vol. 78 ( Springer- 
Verlag, Berlin); Gill et al. (1988) Nature 334: 72 1-724. Such disclosures are herein 

20 incorporated by reference. 

The above list of selectable marker genes is not meant to be limiting. Any 
selectable marker gene can be used in the present invention. 

Typical vectors useful for expression of genes in higher plants are well 
known in the art and include vectors derived from the tumor-inducing (Ti) plasmid 

25 of Agrobacterium tumefaciens described by Rogers et al. ( 1 987) Meth. in Enzymol 
755:253-277 These vectors are plant integrating vectors in that on transformation, 
the vectors integrate a portion of vector DNA into the genome of the host plant. 
Exemplary A. tumefaciens vectors useful herein are plasmids pKYLX6 and 
pKYLX7 of Schardl et al. (1987) Gene 67:1-1 1 and Berger et al. (1989) Proc. 

30 Natl Acad. Sci. (USA) 56:8402-8406. Another useful vector herein is plasmid 
pBI101.2 that is available from Clontech Laboratories, Inc. (Palo Alto, CA). 

As discussed above, a polynucleotide of the present invention can be 
expressed in either sense or antisense orientation as desired. It will be appreciated 
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that control of gene expression in either sense or antisense orientation can have a 
direct impact on the observable plant characteristics. Antisense technology can be 
conveniently used for gene expression in plants. To accomplish this, a nucleic acid 
segment from the desired gene is cloned and operably linked to a promoter such 
5 that the antisense strand of RNA will be transcribed The construct is then 
transformed into plants and the antisense strand of RNA is produced. In plant 
cells, it has been shown that antisense RNA inhibits gene expression by preventing 
the accumulation of mRNA which encodes the enzyme of interest, see, e.g., 
Sheehy et al (1988) Proc. Nad. Acad ScL (USA) £5:8805-8809; and Hiatt et aL, 

10 U.S. Patent No. 4,801,340. 

In the methods of the invention, it is recognized that the entire coding 
sequence for the RPA construct may be utilized Alternatively, portions or 
fragments of the sequence may be used in DNA constructs. 

Fragments and variants of the disclosed nucleotide sequences and proteins 

15 encoded thereby are encompassed by the present invention. By "fragment" is 
intended a portion of the nucleotide sequence or a portion of the amino acid 
sequence and hence protein encoded thereby. Fragments of a nucleotide sequence 
may encode protein fragments that retain the biological activity of the native 
protein and hence modulate DNA metabolism. Alternatively, fragments of a 

20 nucleotide sequence that are useful as hybridization probes generally do not encode 
fragment proteins retaining biological activity Thus, fragments of a nucleotide 
sequence may range from at least about 20 nucleotides, about 50 nucleotides, about 
100 nucleotides, and up to the full-length nucleotide sequence encoding the 
proteins of the invention. 

25 A fragment of a RPA nucleotide sequence that encodes a biologically 

active portion of a RPA protein of the invention will encode at least 15, 25, 30, 50, 
100, 150, 200, or 250 contiguous amino acids, or up to the total number of amino 
acids present in a full-length RPA protein of the invention (for example, 623, 617, 
273, 273, 273, 318, 273, 273 amino acids for SEQ IDNOs: 2. 4, 12, 14, 16, 18, 

30 20, and 22 respectively. Fragments of a RPA nucleotide sequence that are useful 
as hybridization probes for PCR primers generally need not encode a biologically 
active portion of a RPA protein. 
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Thus, a fragment of a RPA nucleotide sequence may encode a biologically 
active portion of a RPA protein, or it may be a fragment that can be used as a 
hybridization probe or PCR primer using methods disclosed below A biologically 
active portion of a RPA protein can be prepared by isolating a portion of one of the 

5 RPA nucleotide sequences of the invention, expressing the encoded portion of the 
RPA protein (e.g., by recombinant expression in vitro), and assessing the activity 
of the encoded portion of the RPA protein. Nucleic acid molecules that are 
fragments of a RPA nucleotide sequence comprise at least 16, 20, 50, 75, 100, 150, 
200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 800, 900, 1,000 nucleotides, 

10 or up to the number of nucleotides present in a full-length RPA nucleotide 

sequence disclosed herein (for example, 2497, 2202, 1 124, 979, 1051, 1087, 1074, 
and 1231 nucleotides for SEQ IDNOs: 1,3, 11, 13, 15, 17, 19, and 21 
respectively). 

By "variants' 1 is intended substantially similar sequences. For nucleotide 

15 sequences, conservative variants include those sequences that, because of the 

degeneracy of the genetic code, encode the amino acid sequence of one of the RPA 
polypeptides of the invention. Such naturally occurring variants including 
naturally occurring allelic variants, can be identified with the use of well-known 
molecular biology techniques, as, for example, with polymerase chain reaction 

20 (PCR) and hybridization techniques as outlined below. Variant nucleotide 

sequences also include synthetically derived nucleotide sequences, such as those 
generated, for example, by using site-directed mutagenesis but which still encode a 
RPA protein of the invention. Generally, variants of a particular nucleotide 
sequence of the invention will have at least 40%, 50%, 60%, 70%, generally at 

25 least 75%, 80%, 85%, preferably about 90% to 95% or more, and more preferably 
about 98% or more sequence identity to that particular nucleotide sequence as 
determined by sequence alignment programs described elsewhere herein using 
default parameters. 

By "variant" protein is intended a protein derived from the native protein 

30 by deletion (so-called truncation) or addition of one or more amino acids to the N- 
terminal and/or C-terminal end of the native protein; deletion or addition of one or 
more amino acids at one or more sites in the native protein; or substitution of one 
or more amino acids at one or more sites in the native protein. Variant proteins 
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encompassed by the present invention are biologically active, that is they continue 
to possess the desired biological activity of the native protein, that is, modulating 
DNA metabolism as described herein. Such variants may result from, for example, 
genetic polymorphism or from human manipulation. Biologically active variants of 
5 a native RPA protein of the invention will have at least 40%, 50%, 60%, 70%, 
generally at least 75%, 80%, 85%, preferably about 90% to 95% or more, and 
more preferably about 98% or more sequence identity to the amino acid sequence 
for the native protein as determined by sequence alignment programs described 
elsewhere herein using default parameters. A biologically active variant of a 

10 protein of the invention may differ from that protein by as few as 1-15 amino acid 
residues, as few as 1-10, such as 6-10, as few as 5, as few as 4, 3, 2, or even 1 
amino acid residue. 

The proteins of the invention may be altered in various ways including 
amino acid substitutions, deletions, truncations, and insertions. Methods for such 

15 manipulations are generally known in the art For example, amino acid sequence 
variants of the RPA proteins can be prepared by mutations in the DNA. Methods 
for mutagenesis and nucleotide sequence alterations are well known in the art. 
See, for example, Kunkel (1985) Proc. Nati Acad Set. USA 52:488-492, Kunkei 
etal (1987) Methods in Enzymoi 75^:367-382, US Patent No. 4,873,192; Walker 

20 and Gaastra, eds. (1983) Techniques in Molecular Biology (MacMillan Publishing 
Company, New York) and the references cited therein Guidance as to appropriate 
amino acid substitutions that do not affect biological activity of the protein of 
interest may be found in the model of Day ho ff et al. (1978) Alias of Protein 
Sequence and Structure (Natl. Biomed. Res. Found., Washington, D C ), herein 

25 incorporated by reference. Conservative substitutions, such as exchanging one 
amino acid with another having similar properties, may be preferred. 

Thus, the genes and nucleotide sequences of the invention include both the 
naturally occurring sequences as well as mutant forms. Likewise, the proteins of 
the invention encompass both naturally occurring proteins as well as variations and 

30 modified forms thereof. Such variants will continue to possess the desired activity 
in influencing DNA metabolism. Obviously, the mutations that will be made in the 
DNA encoding the variant must not place the sequence out of reading frame and 
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- preferably will not create complementary regions that could produce secondary 
mRNA structure. See, EP Patent Application Publication No. 75,444. 

The deletions, insertions, and substitutions of the protein sequence 
encompassed herein are not expected to produce radical changes in the 

5 characteristics of the protein. However, when it is difficult to predict the exact 
effect of the substitution, deletion, or insertion in advance of doing so, one skilled 
in the art will appreciate that the effect will be evaluated by routine screening 
assays. That is, the activity can be evaluated by assessing DNA binding, 
recombination, repair and replication. See, for example, Braun et al (1997) 

10 Biochemistry 36:8443-8454; Longhese et al. (1994) Molecular and Cellular 
Biology 77:7884-7890; Stigger et ai (1998)7. Biol. Chem. 275:9337-9343; 
Abremova et ai (1997) Proc. Natl Acad ScL USA 97:7186-7191; New et al. 
(1998) Nature 397:407-410, Bochkareva et al. (1998) J. Biol Chem. 2730:3932- 
6Mass et al. (1998) Moi Cell Biol. /S(7/j:6399-407, Lavrik et al. (1998) Nucleic 

15 Acids Res 26(2)\6Q2-1\ Sibenaller et al. (1998) 37(3^:12496-506; Matsunaga et 
ai (1996) J. Biol Chem. 271 (19). 1 1047-50; and Sung (1997) Genes & 
Development 11: 1111-21, herein incorporated by reference. 

Variant nucleotide sequences and proteins also encompass nucleotide 
sequences and proteins derived from a mutagenic and recombinogenic procedure 

20 such as DNA shuffling. With such a procedure, one or more different RPA coding 
sequences can be manipulated to create a new RPA possessing the desired 
properties. In this manner, libraries of recombinant polynucleotides are generated 
from a population of related sequence polynucleotides comprising sequence 
regions that have substantial sequence identity and can be homologously 

25 recombined in vitro or in vivo. For example, using this approach, sequence motifs 
encoding a domain of interest may be shuffled between the RPA gene of the 
invention and other known RPA genes to obtain a new gene coding for a protein 
with an improved property of interest, such as an increased K m in the case of an 
enzyme. Strategies for such DNA shuffling are known in the an. See, for 

30 example, Stemmer (1994) Proc. Natl Acad. Sci. USA 91: 10747-10751; Stemmer 
(1994) Nature 370:389-391, Crameri et al (1997) Nature Biotech. /5:436-438, 
Moore et al (1997) 1 Mol Biol 272:336-341, Zhang et al (1997) Proc Natl 
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Acad Sci. USA 94:4504-4509; Crameri et al. (1998) Nature 597:288-291, and U.S. 
Patent Nos. 5,605,793 and 5,837,458. 

It is recognized that with these nucleotide sequences, antisense 
constructions, complementary to at least a portion of the messenger RNA (mRNA) 
5 for the RPA sequences can be constructed. Antisense nucleotides are constructed 
to hybridize with the corresponding mRNA. Modifications of the antisense 
sequences may be made as long as the sequences hybridize to and interfere with 
expression of the corresponding mRNA. In this manner, antisense constructions 
having 70%, preferably 80%, more preferably 85% sequence similarity to the 

10 corresponding antisense sequences may be used. Furthermore, portions of the 
antisense nucleotides may be used to disrupt the expression of the target gene. 
Generally, sequences of at least 50 nucleotides, 100 nucleotides, 200 nucleotides, 
or greater may be used 

The nucleotide sequences of the present invention may also be used in the 

15 sense orientation to suppress the expression of endogenous genes in plants. 

Methods for suppressing gene expression in plants using nucleotide sequences in 
the sense orientation are known in the an. The methods generally involve 
transforming plants with a DNA construct comprising a promoter that drives 
expression in a plant operably linked to at least a portion of a nucleotide sequence 

20 that corresponds to the transcript of the endogenous gene. Typically, such a 
nucleotide sequence has substantial sequence identity to the sequence of the 
transcript of the endogenous gene, preferably greater than about 65% sequence 
identity, more preferably greater than about 85% sequence identity, most 
preferably greater than about 95% sequence identity. See, U.S. Patent Nos. 

25 5,283,184 and 5,034,323; herein incorporated by reference. 

Use of the polypeptides and proteins, and fragments and variants thereof, 
for producing antibodies are also encompassed by the invention. The invention 
also encompasses using such antibodies to determine RPA protein levels, and to 
modulate one or more biological activities or interactions of RPA. Methods for the 

30 production of antibodies are known in the art. See, for example, Harlow and Lane, 
antibodies, A Laboratory Manual, Cold Spring Harbor Publications, New York 
(1988); and the reference is cited therein. 
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The RPA sequences of the invention may be optimized for enhanced 
expression in plants of interest. See, for example, EPA0359472; W091/16432; 
Perlak et al. (1991) Proc. Natl. Acad. Sci. USA 55:3324-3328; and Murray et al. 
(1989) Nucleic Acids Res. 77:477-498. In this manner, the genes can be 
5 synthesized utilizing plant-preferred condons. See, for example, Murray et al. 
(1989) Nucleic Acids Res. 77:477-498, the disclosure of which is incorporated 
herein by reference. In this manner, synthetic genes can also be made based on the 
distribution of codons a particular host uses for a particular amino acid. Thus, the 
nucleotide sequences can be optimized for expression in any plant. It is recognized 

10 that all or any part of the gene sequence may be optimized or synthetic. That is, 
synthetic or partially optimized sequences may also be used. 

Thus nucleotide sequences of the invention and the proteins encoded 
thereby include the native forms as well as variants thereof. The variant proteins 
will be substantially homologous and functionally equivalent to the native proteins. 

15 A variant of a native protein is "substantially homologous" to the native protein 
when at least about 80%, more preferably at least about 90%, and most preferably 
at least about 95% of its amino acid sequence is identical to the amino acid 
sequence of the native protein. By "functionally equivalent" is intended that the 
sequence of the variant defines a chain that produces a protein having substantially 

20 the same biological effect as the native protein of interest. Such functionally 
equivalent variants that comprise substantial sequence variations are also 
encompassed by the invention. 

The nucleotide sequences of the invention can be used to isolate 
corresponding sequences from other organisms, particularly other plants, more 

25 particularly other monocots. In this manner, methods such as PGR, hybridization, 
and the like can be used to identify such sequences based on their sequence 
homology to the sequence set forth herein. Sequences isolated based on their 
sequence identity to the entire RPA sequences set forth herein or to fragments 
thereof are encompassed by the present invention. 

30 In a PCR approach, oligonucleotide primers can be designed for use in PCR 

reactions to amplify corresponding DNA sequences from cDNA or genomic DNA 
extracted from any plant of interest. Methods for designing PCR primers and PCR 
cloning are generally known in the art and are disclosed in Sambrook et al. (1989) 
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Molecular Cloning: A Laboratory Manual (2d e<±, Cold Spring Harbor Laboratory 
Press, Plainview, New York). See also Innis et al, eds. (1990) PCR Protocols: A 
Guide to Methods and Applications (Academic Press, New York); Innis and 
Gelfand, eds. (1995) PCR Strategies (Academic Press, New York); and Innis and 
5 Gelfand, eds. (1999) PCR Methods Manual (Academic Press, New York). Known 
methods of PCR include, but are not limited to, methods using paired primers, 
nested primers, single specific primers, degenerate primers, gene-specific primers, 
vector-specific primers, partially-mismatched primers, and the like. 

In hybridization techniques, all or part of a known nucleotide sequence is 

10 used as a probe that selectively hybridizes to other corresponding nucleotide 
sequences present in a population of cloned genomic DNA fragments or cDNA 
fragments (i.e., genomic or cDNA libraries) from a chosen organism. The 
hybridization probes may be genomic DNA fragments, cDNA fragments, RNA 
fragments, or other oligonucleotides, and may be labeled with a detectable group 

15 such as 32 P, or any other detectable marker. Thus, for example, probes for 

hybridization can be made by labeling synthetic oligonucleotides based on the 
RPA sequences of the invention. Methods for preparation of probes for 
hybridization and for construction of cDNA and genomic libraries are generally 
known in the an and are disclosed in Sambrook et al. ( 1 989) Molecular Cloning: A 

20 Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, Plainview, New 
York). 

For example, the entire RPA sequence disclosed herein, or one or more 
portions thereof, may be used as a probe capable of specifically hybridizing to 
corresponding RPA sequences and messenger RNAs. To achieve specific 

25 hybridization under a variety of conditions, such probes include sequences that are 
unique among RPA sequences and are preferably at least about 10 nucleotides in 
length, and most preferably at least about 20 nucleotides in length. Such probes 
may be used to amplify corresponding RPA sequences from a chosen plant by 
PCR. This technique may be used to isolate additional coding sequences from a 

30 desired plant or as a diagnostic assay to determine the presence of coding 

sequences in a plant Hybridization techniques include hybridization screening of 
plated DNA libraries (either plaques or colonies; see, for example, Sambrook et al. 
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(1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor 
Laboratory Press, Plainview, New York). 

Hybridization of such sequences may be carried out under stringent 
conditions. By "stringent conditions" or "stringent hybridization conditions" is 
5 intended conditions under which a probe will hybridize to its target sequence to a 
detectably greater degree than to other sequences (e.g., at least 2-fold over 
background). Stringent conditions are sequence-dependent and will be different in 
different circumstances. By controlling the stringency of the hybridization and/or 
washing conditions, target sequences that are 100% complementary to the probe 

10 can be identified (homologous probing). Alternatively, stringency conditions can 
be adjusted to allow some mismatching in sequences so that lower degrees of 
similarity are detected (heterologous probing). Generally, a probe is less than 
about 1000 nucleotides in length, preferably less than 500 nucleotides in length. 

Typically, stringent conditions will be those in which the salt concentration 

1 5 is less than about 1 .5 M Na ion, typically about 0.01 to 1 .0 M Na ion concentration 
(or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short 
probes (e.g., 10 to 50 nucleotides) and at least about 60°C for long probes (e.g., 
greater than 50 nucleotides). Stringent conditions may also be achieved with the 
addition of destabilizing agents such as formamide. Exemplary low stringency 

20 conditions include hybridization with a buffer solution of 30 to 35% formamide, 1 
M NaCl, 1% SDS (sodium dodecyl sulphate) at 37°C, and a wash in IX to 2X SSC 
(20X SSC = 3.0 M NaCl/0.3 M tnsodium citrate) at 50 to 55°C Exemplary 
moderate stringency conditions include hybridization in 40 to 45% formamide, 1 .0 
M NaCl, 1% SDS at 37°C, and a wash in 0.5X to IX SSC at 55 to 60°C. 

25 Exemplary high stringency conditions include hybridization in 50% formamide, 1 
M NaCl, 1% SDS at 37°C, and a wash in 0. IX SSC at 60 to 65°C. 

Specificity is typically the function of post-hybridization washes, the 
critical factors being the ionic strength and temperature of the final wash solution. 
For DNA-DNA hybrids, the T m can be approximated from the equation of 

30 Meinkoth and Wahl (1984) Anal Biochem. 735:267-284: T m - 81.5°C + 16.6 (log 
M) + 0.41 (%GC) - 0.61 (% form) - 500/L, where M is the molarity of monovalent 
cations, %GC is the percentage of guanosine and cytosine nucleotides in the DNA, 
% form is the percentage of formamide in the hybridization solution, and L is the 
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length of the hybrid in base pairs. The T m is the temperature (under defined ionic 
strength and pH) at which 50% of a complementary target sequence hybridizes to a 
perfectly matched probe. is reduced by about 1°C for each 1% of mismatching; 
thus, T m , hybridization, and/or wash conditions can be adjusted to hybridize to 
5 sequences of the desired identity. For example, if sequences with >90% identity 
are sought, the T m can be decreased 10°C. Generally, stringent conditions are 
selected to be about 5°C lower than the thermal melting point (T m ) for the specific 
sequence and its complement at a defined ionic strength and pH. However, 
severely stringent conditions can utilize a hybridization and/or wash at 1, 2, 3, or 

10 4°C lower than the thermal melting point (T m ), moderately stringent conditions can 
utilize a hybridization and/or wash at 6, 7 y 8, 9, or 10°C lower than the thermal 
melting point (T m ); low stringency conditions can utilize a hybridization and/or 
wash at 11, 12, 13, 14, 15, or 20°C lower than the thermal melting point (T m ). 
Using the equation, hybridization and wash compositions, and desired T m , those of 

15 ordinary skill will understand that variations in the stringency of hybridization 
and/or wash solutions are inherently described. If the desired degree of 
mismatching results in a T m of less than 45°C (aqueous solution) or 32°C 
(formamide solution), it is preferred to increase the SSC concentration so that a 
higher temperature can be used. An extensive guide to the hybridization of nucleic 

20 acids is found in Tijssen (1993) Laboratory Techniques in Biochemistry and 
Molecular Biology— Hybridization with Nucleic Acid Probes, Pan I, Chapter 2 
(Elsevier, New York); and Ausubel et aL, eds. (1995) Current Protocols in 
Molecular Biology, Chapter 2 (Greene Publishing and Wiley-lnterscience, New 
York). See Sambrook et ai (1989) Molecular Cloning: A Laboratory Manual (2d 

25 ed., Cold Spring Harbor Laboratory Press, Plainview, New York). 

Thus, isolated sequences that have promoter activity or encode for a RPA 
protein and which hybridize under stringent conditions to the RPA sequences 
disclosed herein, or to fragments thereof, are encompassed by the present 
invention. Such sequences will be at least 40% to 50% homologous, about 60% to 

30 70% homologous, and even about 75%, 80%, 85%, 90%, 95% to 98% homologous 
or more with the disclosed sequences. That is, the sequence identity of sequences 
may range, sharing at least 40% to 50%, about 60% to 70%, and even about 75%, 
80%, 85%, 90%, 95% to 98% or more sequence identity. 



WO 00/15816 



PCT/US99/21277 



The following terms are used to describe the sequence relationships 
between two or more nucleic acids or polynucleotides: (a) "reference sequence", 
(b) "comparison window", (c) "sequence identity", (d) "percentage of sequence 
identity", and (e) "substantial identity". 
5 (a) As used herein, "reference sequence" is a defined sequence used as 

a basis for sequence comparison. A reference sequence may be a subset or the 
entirety of a specified sequence; for example, as a segment of a full-length cDNA 
or gene sequence, or the complete cDNA or gene sequence. 

(b) As used herein, "comparison window" makes reference to a 
10 contiguous and specified segment of a polynucleotide sequence, wherein the 
polynucleotide sequence in the comparison window may comprise additions or 
deletions (i.e., gaps) compared to the reference sequence (which does not comprise 
additions or deletions) for optimal alignment of the two sequences Generally, the 
comparison window is at least 20 contiguous nucleotides in length, and optionally 
15 can be 30, 40, 50, 100, or longer. Those of skill in the art understand that to avoid 
a high similarity to a reference sequence due to inclusion of gaps in the 
polynucleotide sequence a gap penalty is typically introduced and is subtracted 
from the number of matches. 

Methods of alignment of sequences for comparison are well known in the 
20 an. Thus, the determination of percent identity between any two sequences can be 
accomplished using a mathematical algorithm Preferred, non-limiting examples 
of such mathematical algorithms are the algorithm of Myers and Miller (1988) 
CABIOS 4\\\A1\ the local homology algorithm of Smith et al. {\9%\)Adv. Appi 
Math. 2:482; the homology alignment algorithm of Needleman and Wunsch (1970) 
25 J. Mol. Biol. ^5:443-453, the search-for-similarity-method of Pearson and Lipman 
(1988) Proc. Natl. Acad Sci. 55:2444-2448; the algorithm of Karlin and Altschul 
(1990) Proc. Natl Acad. Sci. USA 572264, modified as in Karlin and Altschul 
(1993) Proc. Natl. Acad Sci. USA 90:5873-5877. 

Computer implementations of these mathematical algorithms can be 
30 utilized for comparison of sequences to determine sequence identity. Such 
implementations include, but are not limited to: CLUSTAL in the PC/Gene 
program (available from Intelligenetics, Mountain View, California), the ALIGN 
program (Version 2.0) and GAP, BESTFIT, BLAST, FASTA, and TFASTA in the 
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Wisconsin Genetics Software Package, Version 8 (available from Genetics 
Computer Group (GCG), 575 Science Drive, Madison, Wisconsin, USA). 
Alignments using these programs can be performed using the default parameters. 
The CLUSTAL program is well described by Higgins et al (1988) Gene 73:237- 
5 244 (1988); Higgins et al (1989) CABIOS 5: 151-153; Corpet et al (1988) Nucleic 
Acids Res 76:10881-90; Huang et al (1992) CABIOS 8. \S5-6S\ and Pearson et al 
(1994) Meth. MoL Biol. 24:307-331. The ALIGN program is based on the 
algorithm of Myers and Miller (1988) supra. A P AM 120 weight residue table, a 
gap length penalty of 12, and a gap penalty of 4 can be used with the .ALIGN 

10 program when comparing amino acid sequences. The BLAST programs of 

Altschul et al (1990) 1 MoL Biol 2 15: 403 are based on the algorithm of Karlin 
and Altschul (1990) supra. BLAST nucleotide searches can be performed with the 
BLASTN program, score = 100, wordlength = 12, to obtain nucleotide sequences 
homologous to a nucleotide sequence encoding a protein of the invention. BLAST 

15 protein searches can be performed with the BLASTX program, score = 50, 
wordlength = 3, to obtain amino acid sequences homologous to a protein or 
polypeptide of the invention. To obtain gapped alignments for comparison 
purposes, Gapped BLAST (in BLAST 2.0) can be utilized as described in Altschul 
et al (1997) Nucleic Acids Res. 25:3389. Alternatively, PSI-BLAST (in BLAST 

20 2.0) can be used to perform an iterated search that detects distant relationships 
between molecules. See Altschul et al (1997) supra. When utilizing BLAST, 
Gapped BLAST, PSI-BLAST, the default parameters of the respective programs 
(e.g., BLASTN for nucleotide sequences, BLASTX for proteins) can be used. See 
http://www ncbi.nlm.nih.gov Alignment may also be performed manually by 

25 inspection. Alignment may also be performed manually by inspection. 

For purposes of the present invention, comparison of nucleotide or protein 
sequences for determination of percent sequence identity to the RPA sequences 
disclosed herein is preferably made using the GCG PileUp program, version 10.00, 
with its default parameters or any equivalent program. By "equivalent program" is 

30 intended any sequence comparison program that, for any two sequences in 

question, generates an alignment having identical nucleotide or amino acid residue 
matches and an identical percent sequence identity when compared to the 
corresponding alignment generated by the preferred program. 
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(c) As used herein, "sequence identity 11 or "identity" in the context of 
two nucleic acid or polypeptide sequences makes reference to the residues in the 
two sequences that are the same when aligned for maximum correspondence over a 
specified comparison window. When percentage of sequence identity is used in 
5 reference to proteins it is recognized that residue positions which are not identical 
often differ by conservative amino acid substitutions, where amino acid residues 
are substituted for other amino acid residues with similar chemical properties (e.g., 
charge or hydrophobicity) and therefore do not change the functional properties of 
the molecule. When sequences differ in conservative substitutions, the percent 

10 sequence identity may be adjusted upwards to correct for the conservative nature 
of the substitution. Sequences that differ by such conservative substitutions are 
said to have "sequence similarity 1 ' or "similarity" Means for making this 
adjustment are well known to those of skill in the art. Typically this involves 
scoring a conservative substitution as a partial rather than a full mismatch, thereby 

15 increasing the percentage sequence identity Thus, for example, where an identical 
amino acid is given a score of 1 and a non-conservative substitution is given a 
score of zero, a conservative substitution is given a score between zero and 1 , The 
scoring of conservative substitutions is calculated, e.g., as implemented in the 
program PC/GENE (Intelligenetics, Mountain View, California). 

20 (d) As used herein, "percentage of sequence identity" means the value 

determined by comparing two optimally aligned sequences over a comparison 
window, wherein the portion of the polynucleotide sequence in the comparison 
window may comprise additions or deletions (i.e., gaps) as compared to the 
reference sequence (which does not comprise additions or deletions) for optimal 

25 alignment of the two sequences. The percentage is calculated by determining the 
number of positions at which the identical nucleic acid base or amino acid residue 
occurs in both sequences to yield the number of matched positions, dividing the 
number of matched positions by the total number of positions in the window of 
comparison, and multiplying the result by 100 to yield the percentage of sequence 

30 identity. 

(e)(i) The term "substantial identity" of polynucleotide sequences means 
that a polynucleotide comprises a sequence that has at least 70% sequence identity, 
preferably at least 80%, more preferably at least 90%, and most preferably at least 
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95%, compared to a reference sequence using one of the alignment programs 
described using standard parameters. One of skill in the art will recognize that 
these values can be appropriately adjusted to determine corresponding identity of 
proteins encoded by two nucleotide sequences by taking into account codon 
5 degeneracy, amino acid similarity, reading frame positioning, and the like. 

Substantial identity of amino acid sequences for these purposes normally means 
sequence identity of at least 60%, more preferably at least 70%, 80%, 90%, and 
most preferably at least 95%. 

Another indication that nucleotide sequences are substantially identical is if 

10 two molecules hybridize to each other under stringent conditions. Generally, 
stringent conditions are selected to be about 5°C lower than the thermal melting 
point (T m ) for the specific sequence at a defined ionic strength and pH. However, 
stringent conditions encompass temperatures in the range of about 1°C to about 
20°C, depending upon the desired degree of stringency as otherwise qualified 

15 herein. Nucleic acids that do not hybridize to each other under stringent conditions 
are still substantially identical if the polypeptides they encode are substantially 
identical. This may occur, e.g., when a copy of a nucleic acid is created using the 
maximum codon degeneracy permitted by the genetic code. One indication that 
two nucleic acid sequences are substantially identical is when the polypeptide 

20 encoded by the first nucleic acid is immunologically cross reactive with the 
polypeptide encoded by the second nucleic acid. 

(e)(ii) The term "substantial identity 1 ' in the context of a peptide indicates 
that a peptide comprises a sequence with at least 70% sequence identity to a 
reference sequence, preferably 80%, more preferably 85%, most preferably at least 

25 90% or 95% sequence identity to the reference sequence over a specified 
comparison window. Preferably, optimal alignment is conducted using the 
homology alignment algorithm of Needleman et al (1970) J. Moi Biol. 48A43. 
An indication that two peptide sequences are substantially identical is that one 
peptide is immunologically reactive with antibodies raised against the second 

30 peptide. Thus, a peptide is substantially identical to a second peptide, for example, 
where the two peptides differ only by a conservative substitution. Peptides that are 
"substantially similar" share sequences as noted above except that residue 
positions that are not identical may differ by conservative amino acid changes. 
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Using the nucleic acids of the present invention, one may express a protein 
of the present invention in a recombinantiy engineered cell such as bacteria, yeast, 
insect, mammalian, or preferably plant ceils. The cells produce the protein in a 
non-natural condition (e.g., in quantity, composition, location, and/or time), 
5 because they have been genetically altered through human intervention to do so. 
It is expected that those of skill in the art are knowledgeable in the 
numerous expression systems available for expression of a nucleic acid encoding a 
protein of the present invention. No attempt to describe in detail the various 
methods known for the expression of proteins in prokaryotes or eukaryotes will be 
1 0 made. 

In brief summary, the expression of isolated nucleic acids encoding a 
protein of the present invention will typically be achieved by operably linking, for 
example, the DNA or cDNA to a promoter (which is either constitutive or 
inducible), followed by incorporation into an expression vector. The vectors can 

1 5 be suitable for replication and integration in either prokaryotes or eukaryotes. 
Typical expression vectors contain transcription and translation terminators, 
initiation sequences, and promoters useful for regulation of the expression of the 
DNA encoding a protein of the present invention. To obtain high level expression 
of a cloned gene, it is desirable to construct expression vectors which contain, at 

20 the minimum, a strong promoter to direct transcription, a ribosome binding site for 
translational initiation, and a transcription/translation terminator. One of skill 
would recognize that modifications can be made to a protein of the present 
invention without diminishing its biological activity. Some modifications may be 
made to facilitate the cloning, expression, or incorporation of the targeting 

25 molecule into a fusion protein. Such modifications are well known to those of skill 
in the art and include, for example, a methionine added at the amino terminus to 
provide an initiation site, or additional amino acids (e.g., poly His) placed on either 
terminus to create conveniently located restriction sites or termination codons or 
purification sequences. 

30 Prokaryotic cells may be used as hosts for expression. Prokaryotes most 

frequently are represented by various strains of E. coli\ however, other microbial 
strains may also be used. Commonly used prokaryotic control sequences which 
are defined herein to include promoters for transcription initiation, optionally with 
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an operator, along with ribosome binding site sequences, include such commonly 
used promoters as the beta lactamase (penicillinase) and lactose (lac) promoter 
systems (Chang et ai (1977) Nature /9<?:1056), the tryptophan (trp) promoter 
system (Goeddel et ai (1980) Nucleic Acids Res. 5:4057) and the lambda-derived 
5 PL promoter and N-gene ribosome binding site (Shimatake et ai ( 1 98 1 ) Nature 
292: 128). The inclusion of selection markers in DNA vectors transfected in E. coli 
is also useful. Examples of such markers include genes specifying resistance to 
ampicillin, tetracycline, or chloramphenicol. 

The vector is selected to allow introduction into the appropriate host ceil. 

10 Bacterial vectors are typically of plasmid or phage origin. Appropriate bacterial 
cells are infected with phage vector panicles or transfected with naked phage 
vector DNA. If a plasmid vector is used, the bacterial cells are transfected with the 
plasmid vector DNA. Expression systems for expressing a protein of the present 
invention are available using Bacillus sp. and Salmonella (Palva et ai (1983) Gene 

15 22:229-235, Mosbachefa/. (1983) Nature 302.543-545). 

A variety of eukaryotic expression systems such as yeast, insect cell lines, 
plant and mammalian cells, are known to those of skill in the art. The sequences of 
the present invention can be expressed in these eukaryotic systems In some 
embodiments, transformed/transfected plant ceils are employed as expression 

20 systems for production of the proteins of the instant invention. 

Synthesis of heterologous proteins in yeast is well known. Sherman, F. et 
ai (1982) Methods in Yeast Genetics, Cold Spring Harbor Laboratory is a well 
recognized work describing the various methods available to produce the protein in 
yeast. Two widely utilized yeast for production of eukaryotic proteins are 

25 Saccharomyces cerevisia and Pichia pastoris Vectors, strains, and protocols for 
expression in Saccharomyces and Pichia are known in the art and available from 
commercial suppliers (e.g., Invitrogen). Suitable vectors usually have expression 
control sequences, such as promoters, including 3-phosphoglycerate kinase or 
alcohol oxidase, and an origin of replication, termination sequences and the like as 

30 desired. 

A protein of the present invention, once expressed, can be isolated from 
yeast by lysing the cells and applying standard protein isolation techniques to the 
lysates. The monitoring of the purification process can be accomplished by using 
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Western blot techniques or radioimmunoassay of other standard immunoassay 
techniques. 

The sequences encoding proteins of the present invention can also be 
ligated to various expression vectors for use in transfecting cell cultures of, for 

5 instance, mammalian, insect, or plant origin. Illustrative of cell cultures useful for 
the production of the peptides are mammalian cells. Mammalian cell systems 
often will be in the form of monolayers of cells although mammalian cell 
suspensions may also be used. A number of suitable host cell lines capable of 
expressing intact proteins have been developed in the an, and include the HEK293, 

10 BHK21, and CHO cell lines. Expression vectors for these cells can include 

expression control sequences, such as an origin of replication, a promoter (e.g., the 
CMV promoter, a HSV tk promoter or pgk (phosphoglycerate kinase promoter)), 
an enhancer (Queen ei al (1986) Immunol Rev. #9:49), and necessary processing 
information sites, such as ribosome binding sites, RNA splice sites, 

15 polyadenylation sites (e.g., an SV40 large T Ag poly A addition site), and 

transcriptional terminator sequences. Other animal cells useful for production of 
proteins of the present invention are available, for instance, from the American 
Type Culture Collection Catalogue of Cell Lines and Hybridomas (7th edition, 
1992). 

20 Appropriate vectors for expressing proteins of the present invention in 

insect cells are usually derived from the SF9 baculovirus. Suitable insect cell lines 
include mosquito larvae, silkworm, armyworm, moth and Drosophda cell lines 
such as a Schneider cell line (See Schneider et al. (1987)./ EmbryoL Exp. 
Morphol 27: 353-365). 

25 As with yeast, when higher animal or plant host cells are employed, 

polyadenylation or transcription terminator sequences are typically incorporated 
into the vector. An example of a terminator sequence is the polyadenylation 
sequence from the bovine growth hormone gene. Sequences for accurate splicing 
of the transcript may also be included. An example of a splicing sequence is the 

30 VP1 intron from SV40 (Sprague et al. (1983) J. Virol -/5:773-781). Additionally, 
gene sequences to control replication in the host cell may be incorporated into the 
vector such as those found in bovine papilloma virus-type vectors. Saveria- 
Campo, M., Bovine Papilloma Virus DNA a Eukaryotic Cloning Vector in DNA 
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Cloning Vol II a Practical Approach, DM. Glover, ed., IRL Press, Arlington, 
Virginia pp. 213-238 (1985). 

The sequences of the invention can be introduced into any plant of interest, 
and used for transformation of any plant species. The sequences to be introduced 
5 may be used in expression cassettes for expression in the particular plant of 
interest. 

Plants of interest include, but are not limited to corn (Zea mays), Brassica sp. 
(e.g., B. napus, B. rapa, B.juncea), particularly those Brassica species useful as 
sources of seed oil, alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secale 

10 cereale), sorghum (Sorghum bicolor, Sorghum vulgare), millet (e.g., pear! millet 
(Pennisetum glaucum\ proso millet (Panicum miliaceum), foxtail millet (Setaria 
italica), finger millet (Eleusine coracana)), sunflower (Helianthus annuus), safflower 
(Carthamus tinctonus), wheat {Triticum aestivum), soybean (Glycine max), tobacco 
(Nicotiana tabacum), potato (Solanum tuberosum), peanuts (Arachis hypogaea), 

1 5 cotton (Gossypium barbadense, Gossypium hirsutum), sweet potato (Ipomoea 

batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), 
pineapple (Ananas comosus), citrus trees (Citrus spp ), cocoa (Theobroma cacao), tea 
(Camellia sinensis), banana (Musa spp ), avocado (Persea americana), fig (Ficus 
casica), guava (Psidium guajava), mango (Mangifera indica), olive (Olea europaea\ 

20 papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia 
integrifolia), almond (Primus amy gdalus), sugar beets {Beta vulgaris), sugarcane 
(Saccharum spp ), oats, barley, vegetables, ornamentals, and conifers 

Vegetables include tomatoes (Lycopersicon esculentum), lettuce (e.g., 
Lactuca sativa), green beans (Phaseolus vulgaris), lima beans (Phaseolus limensis\ 

25 peas (Lathyrus spp.), and members of the genus Cucumis such as cucumber (C 
sativus\ cantaloupe (C. cantalupensis), and musk melon (C meld). Ornamentals 
include azalea (Rhododendron spp ), hydrangea (Macrophylla hydrangea), hibiscus 
(Hibiscus rosasanensis), roses (Rosa spp.), tulips (Tulipa spp ), daffodils (Narcissus 
spp.), petunias (Petunia hybrida), carnation (Dianihus caryophyllus), poinsettia 

30 (Euphorbia pulcherrima), and chrysanthemum. Conifers that may be employed in 
practicing the present invention include, for example, pines such as loblolly pine 
(Pinus taeda\ slash pine (Pinus ellioiii), ponderosa pine (Pinus ponderosa), 
lodgepole pine (Pinus contorta), and Monterey pine (Pinus radiata), Douglas-fir 
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{Pseudotsuga menziesii), Western hemlock (Tsuga canadensis), Sitka spruce (Picea 
glauca), redwood {Sequoia sempervirens)\ true firs such as silver fir {Abies amabilis) 
and balsam fir {Abies balsamea); and cedars such as Western red cedar {Thuja 
plicaia) and Alaska yellow-cedar {Chamaecypahs nootkatensis) Preferably, plants 
5 of the present invention are crop plants (for example, corn, alfalfa, sunflower, 

Brassica, soybean, cotton, safflower, peanut, sorghum, wheat, millet, tobacco, etc.), 
more preferably corn and soybean plants, yet more preferably corn plants. 

Plants of particular interest include grain plants that provide seeds of 
interest, oil-seed plants, and leguminous plants. Seeds of interest include grain 
10 seeds, such as corn, wheat, barley, rice, sorghum, rye, etc. Oil-seed plants include 
cotton, soybean, safflower, sunflower, Brassica, maize, alfalfa, palm, coconut, etc. 
Leguminous plants include beans and peas. Beans include guar, locust bean, 
fenugreek, soybean, garden beans, cowpea, mungbean, lima bean, fava bean, 
lentils, chickpea, etc. 

15 The RPA coding and antisense sequences of the invention are provided in 

expression cassettes for expression in the plant of interest. The cassette will 
include 5' and 3' regulatory sequences operably linked to a RPA sequence of the 
invention. The cassette may additionally contain at least one additional gene to be 
cotransformed into the organism. Alternatively, the additional gene(s) can be 

20 provided on another expression cassette. By "operably linked" is intended a 
functional linkage between a promoter and a second sequence, wherein the 
promoter sequence initiates and mediates transcription of the DNA sequence 
corresponding to the second sequence. Generally, operably linked means that the 
nucleic acid sequences being linked are contiguous and, where necessary to join 

25 two protein coding regions, contiguous and in the same reading frame. 

Such an expression cassette is provided with a plurality of restriction sites 
for insertion of the RPA sequence to be under the transcriptional regulation of the 
regulatory regions. The expression cassette may additionally contain selectable 
marker genes. 

30 The expression cassette will include in the 5-3' direction of transcription, a 

transcriptional and translational initiation region, a RPA DNA sequence of the 
invention, and a transcriptional and translational termination region functional in 
plants. The transcriptional initiation region, the promoter, may be native or 
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analogous or foreign or heterologous to the plant host. Additionally, the promoter 
may be the natural sequence or alternatively a synthetic sequence. By "foreign" is 
intended that the transcriptional initiation region is not found in the native plant 
into which the transcriptional initiation region is introduced As used herein, a 
5 chimeric gene comprises a coding sequence operably linked to a transcription 
initiation region that is heterologous to the coding sequence. 

While it may be preferable to express the sequences using heterologous 
promoters, the native promoter sequences may be used. Such constructs would 
change expression leveis of RPA in the plant or plant cell. Thus, the phenotype of 

10 the plant or plant cell is altered. 

The termination region may be native with the transcriptional initiation 
region, may be native with the operably linked DNA sequence of interest, or may 
be derived from another source. Convenient termination regions are available 
from the Ti-plasmid of A. tumefaciens, such as the octopine synthase and nopaiine 

15 synthase termination regions. See also Guerineau ei ai (1991) MoL Gen. Genet. 
262:141-144, Proudfoot (1991) Cell (54:671-674; Sanfacon ei ai (1991) Genes 
Dev. 5:141-149; Mogen el ai (1990) Plant Cell 2:1261-1272, Munroe el al. (1990) 
Gene 97:151-158; Ballas etal (1989) Nucleic Acids Res. 1 7: 789 1-7903, and Joshi 
etai (1987) Nucleic Acid Res. 75:9627-9639. 

20 Where appropriate, the gene(s) may be optimized for increased expression 

in the transformed plant. That is ? the genes can be synthesized using plant- 
preferred codons for improved expression. See, for example. Campbell and Gowri 
(1990) Plant Physiol. 92:1-1 1 for a discussion of host-preferred codon usage. 
Methods are available in the art for synthesizing plant-preferred genes. See, for 

25 example, U.S. Patent Nos. 5,380,831, and 5,436,391, and Murray el ai (1989) 
Nucleic Acids Res. 77:477-498, herein incorporated by reference. 

Additional sequence modifications are known to enhance gene expression 
in a cellular host. These include elimination of sequences encoding spurious 
polyadenylation signals, exon-intron splice site signals, transposon-like repeats, 

30 and other such well-characterized sequences that may be deleterious to gene 

expression. The G-C content of the sequence may be adjusted to levels average for 
a given cellular host, as calculated by reference to known genes expressed in the 

41 



WO 00/15816 



PCT/US99/21277 



host cell. When possible, the sequence is modified to avoid predicted hairpin 
secondary mRNA structures. 

The expression cassettes may additionally contain 5' leader sequences in 
the expression cassette construct. Such leader sequences can act to enhance 

5 translation. Translation leaders are known in the art and include: picornavirus 
leaders, for example, EMCV leader (Encep halo myocarditis 5' noncoding region) 
(Elroy-Stein et al (1989) PNAS USA 56:6126-6130); potyvirus leaders, for 
example, TEV leader (Tobacco Etch Virus) (Allison etal (1986), MDMV leader 
(Maize Dwarf Mosaic Virus), Virology 154.9-20), and human immunoglobulin 

10 heavy-chain binding protein (BiP), (Macejak et al. (1991) Nature 553:90-94); 
untranslated leader from the coat protein mRNA of alfalfa mosaic virus (AMV 
RNA 4) (Jobling et al. (1987) Nature 525:622-625); tobacco mosaic virus leader 
(TMV) (Gallie et al (1989) in Molecular Biology of RNA, ed. Cech (Liss, New 
York), pp. 237-256); and maize chlorotic mottle virus leader (MCMV) (Lommel et 

15 al (1991) Virology 57:382-385). See also, Della-Cioppa et al (1987) Plant 
Physiol 54:965-968. Other methods known to enhance translation can also be 
utilized, for example, introns, and the like. 

In preparing the expression cassette, the various DNA fragments may be 
manipulated, so as to provide for the DNA sequences in the proper orientation and, 

20 as appropriate, in the proper reading frame. Toward this end, adapters or linkers 
may be employed to join the DNA fragments or other manipulations may be 
involved to provide for convenient restriction sites, removal of superfluous DNA, 
removal of restriction sites, or the like. For this purpose, in vitro mutagenesis, 
primer repair, restriction, annealing, resubstitutions, e.g., transitions and 

25 transversions, may be involved. 

The sequences of the present invention can be used to transform or 
transfect any plant. In this manner, genetically modified plants, plant cells, plant 
tissue, seed, and the like can be obtained. Transformation protocols as well as 
protocols for introducing nucleotide sequences into plants may vary depending on 

30 the type of plant or plant cell, i.e., monocot or dicot, targeted for transformation. 
Suitable methods of introducing nucleotide sequences into plant cells and 
subsequent insertion into the plant genome include microinjection (Crossway et al. 
(1986) Biotechniques 4:320-334), electroporation (Riggs et al (1986) Proc. Natl 



WO 00/15816 



PCT/US99/21277 



Acad. Sci. USA 53:5602-5606, Agrobacterium-med'iaied transformation (Townsend 
et al, U.S. Pat No. 5,563,055), direct gene transfer (Paszkowski et al. (1984) 
EMBOJ. 3:2717-2722), and ballistic panicle acceleration (see, for example, 
Sanford et al, U.S. Patent No. 4,945,050; Tomes et al, U.S. Patent No. 5,879,918, 
5 Tomes et al, U.S. Patent No. 5,886,244; Bidney et al, U.S. Patent No. 5,932,782; 
Tomes et al (1995) "Direct DNA Transfer into Intact Plant Cells via 
Microprojectile Bombardment," in Plant Cell Tissue, and Organ Culture: 
Fundamental Methods, ed. Gamborg and Phillips (Springer-Verlag, Berlin); and 
McCabe et al (1988) Biotechnology 6:923-926). Also see Weissinger et al (1988) 

10 Ann. Rev. Genet. 22:421-477, Sanford et al (1987) Particulate Science and 
Technology 5.27-37 (onion); Christou et al (1988) Plant Physiol 57:671-674 
(soybean), McCabe et al (1988) Bio/Technology 6:923-926 (soybean), Finer and 
McMullen ( 1 99 1 ) /// Vitro Cell Dev. Biol 21?: 1 75- 1 82 (soybean), Singh et al 
(1998) Theor. Appl Genet. 96 319-324 (soybean); Dattaer a/. (1990) 

15 Biotechnology 5:736-740 (rice), Klein et al (1988) Proc. Natl Acad. Sci. USA 
55:4305-4309 (maize), Klein et al (1988) Biotechnology 6:559-563 (maize); 
Tomes, U.S. Patent No. 5,240,855; Buising et al, U.S. Patent Nos. 5,322,783 and 
5,324,646, Tomes et al (1995) "Direct DNA Transfer into Intact Plant Cells via 
Microprojectile Bombardment," in Plant Cell Tissue, and Organ Culture: 

20 Fundamental Methods, ed. Gamborg (Springer-Verlag, Berlin) (maize); Klein el 
al (1988) Plant Physiol 97:440-444 (maize); Fromm et al (1990) Biotechnology 
5.833-839 (maize); Hooykaas-Van Slogteren et al (1984) Nature (London) 
311.163-164, Bowen et al, U.S. Patent No. 5,736,369 (cereals); Bytebier et al 
(1987) Proc. Natl. Acad Set USA 54:5345-5349 (Liliaceae); De Wet et al (1985) 

25 in The Experimental Manipulation of Ovule Tissues, ed. Chapman et al (Longman, 
New York), pp. 197-209 (pollen); Kaeppler et al (1990) Plant Cell Reports 9:415- 
418 and Kaeppler et al (1992) Theor. Appl. Genet. 54:560-566 (whisker-mediated 
transformation), D'Halluin et al (1992) Plant Cell 4: 1495-1505 (electroporation); 
Li et al (1993) Plant Cell Reports 72:250-255 and Christou and Ford (1995) 

30 Annals of Botany 75:407-4 1 3 (rice), Osjoda et al. ( 1 996) Nature Biotechnology 
74:745-750 (maize via Agrobacterium tumefaciens), all of which are herein 
incorporated by reference. 
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The cells that have been transformed may be grown into plants in 
accordance with conventional ways. See, for example, McCormick et ai (1986) 
Plant Cell Reports 5:81-84. These plants may then be grown, and either pollinated 
with the same transformed strain or different strains, and the resulting hybrid 
5 having constitutive expression of the desired phenotypic characteristic identified. 
Two or more generations may be grown to ensure that expression of the desired 
phenotypic characteristic is stably maintained and inherited and then seeds 
harvested to ensure expression of the desired phenotypic characteristic has been 
achieved. 

10 Transgenic plants expressing the selectable marker can be screened for 

transmission of the nucleic acid of the present invention by, for example, standard 
immunoblot and DNA detection techniques. Transgenic lines are also typically 
evaluated on levels of expression of the heterologous nucleic acid. Expression at 
the RNA level can be determined initially to identify and quantitate expression- 

15 positive plants. Standard techniques for RNA analysis can be employed and 
include PCR amplification assays using oligonucleotide primers designed to 
amplify only the heterologous RNA templates and solution hybridization assays 
using heterologous nucleic acid-specific probes. The RNA-positive plants can then 
be analyzed for protein expression by Western immunoblot analysis using the 

20 specifically reactive antibodies of the present invention. In addition, in situ 

hybridization and immunocytochemistry according to standard protocols can be 
done using heterologous nucleic acid specific polynucleotide probes and 
antibodies, respectively, to localize sites of expression within transgenic tissue. 
Generally, a number of transgenic lines are usually screened for the incorporated 

25 nucleic acid to identify and select plants with the most appropriate expression 
profiles. 

A preferred embodiment is a transgenic plant that is homozygous for the 
added heterologous nucleic acid; i.e., a transgenic plant that contains two added 
nucleic acid sequences, one gene at the same locus on each chromosome of a 
30 chromosome pair. A homozygous transgenic plant can be obtained by sexually 
mating (selfing) a heterozygous transgenic plant that contains a single added 
heterologous nucleic acid, germinating some of the seed produced and analyzing 
the resulting plants produced for altered RPA expression relative to a control plant 
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(i.e., native, non-transgenic). Backcrossing to a parental plant and out-crossing 
with a non-transgenic plant are also contemplated. 

The present invention further provides a method for modulating (i.e., 
increasing or decreasing) RPA levels in a plant or part thereof. Modulation can be 
5 effected by increasing or decreasing the total amount of RPA (i.e., its content) 

and/or the ratio of various RPA subunit proteins (i.e., its composition) in the plant. 
The method comprises transforming a plant cell with a recombinant expression 
cassette comprising a polynucleotide of the present invention as described above to 
obtain a transformed plant cell, growing the transformed plant cell under plant 

10 forming conditions, and inducing expression of a polynucleotide of the present 
invention in the plant for a time sufficient to modulate RPA content and/or 
composition in the plant or plant part. 

In some embodiments, RPA in a plant may be modulated by altering, in 
vivo or in vitro, the promoter of a non-isolated RPA gene to up- or down-regulate 

15 gene expression. In some embodiments, the coding regions of native RPA genes 
an be altered via substitution, addition, insertion, or deletion to decrease activity of 
the encoded enzyme. See, e.g., Kmiec, U.S. Patent 5,565,350, Zarling et al., 
PCTAJS93/03868. And in some embodiments, an isolated nucleic acid (e.g., a 
vector) comprising a promoter sequence is transfected into a plant cell. 

20 Subsequently, a plant cell comprising the promoter operably linked to a 

polynucleotide of the present invention is selected by means known to those of 
skill in the an such as, but not limited to, Southern blot, DNA sequencing, or PCR 
analysis using primers specific to the promoter and to the gene and detecting 
- amplicons produced therefrom. A plant or plant part altered or modified by the 

25 foregoing embodiments is grown under plant forming conditions for a time 

sufficient to modulate RPA content and/or composition in the plant. Plant forming 
conditions are well known in the art and discussed briefly, supra. 

In general, content or composition is increased or decreased by at least 5%, 
10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% relative to a native control 

30 plant, plant part, or cell lacking the aforementioned recombinant expression 

cassette. Modulation in the present invention may occur during and/or subsequent 
to growth of the plant to the desired stage of development. Modulating nucleic 
acid expression temporally and/or in particular tissues can be controlled by 
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employing the appropriate promoter operably linked to a polynucleotide of the 
present invention in, for example, sense or antisense orientation as discussed in 
greater detail, supra. Induction of expression of a polynucleotide of the present 
invention can also be controlled by exogenous administration of an effective 
5 amount of inducing compound. Inducible promoters and inducing compounds that 
activate expression from these promoters are well known in the art. In preferred 
embodiments, RPA is modulated in monocots, particularly maize. 

The ability of RPA to interact with multiple proteins or protein complexes 
allows it to participate and regulate these multiple pathways of DNA metabolism. 

10 For example, it has been shown in mammalian systems that are RPA interacts with 
DNA polymerase alpha (Banin et al. (1997) Biochemistry 36 .8443-8454), p53 
(Dutta et al. (1993) Nature 565:79-82), RAD 62 (Park et al. (1996) ./ Biol Chem. 
277:18996-19000). 

Participation of the middle subunit of RPA in protein-protein interactions 

15 has also been shown. Examples of such interactions include, but are not limited to 
interactions with XPA protein and RAD 52 (He et al. (1995) Nature 374 566-69; 
Matsuda et al. (1995) 1 Bioi Chem. 270.4152-57; Li et al. (1995) MoL Cell Biol 
75:5396-402, Park etal. (1996)7. Biol Chem. 277:18996-19000), and PCNA 
(Shivji etal. (1995) Biochemistry 34:501 1-5017). 

20 Similarly, yeast RPA has been shown to be involved in multiple functions 

in DNA metabolism (Umezu et al. (1998) Genetics /VS.989-1005). Therefore, the 
proteins of the invention may be useful as a ligand to purify and clone other 
proteins involved in DNA recombination, repair, and replication. Particularly, the 
maize proteins may be useful to purify other maize proteins involved in DNA 

25 metabolism. For example, the RPA proteins of the invention may be insolubilized 
on a solid matrix (e.g. agrose or nylon beads) for affinity purification, or the RPA 
cDNA may be used as a bait in a yeast to-hybrid system. In this manner, other 
proteins may be used identified and isolated. 

The following examples are offered by way of illustration and not by way 

30 of limitation. 
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EXPERIMENTAL 

Example 1: cDNA Cloning 

Total RNA was isolated from corn tissues with TRIzol Reagent (Life 
5 Technology, Inc. Gaithersburg, MD) using a modification of the guanidine 
isothiocyanate/acid-phenol procedure described by Chomozynski and Sacchi 
(Chomczynski et al. (1987) Anal Biochem. 762:156). In brief, plant tissue 
samples were pulverized in liquid nitrogen before the addition of the TRIzol 
Reagent, and then were further homogenized with a mortar and pestle. Addition of 

10 chloroform by centrifugation was conducted for separation of an aqueous phase 
and an organic phase. The total RNA was recovered by precipitation with 
isopropy! alcohol from the aqueous phase. 

The selection of poly(A)+RNA from total RNA was performed using 
PolyATract system (Promega Corporation, Madison, WI) In brief, biotinylated 

15 oligo (dT) primers were used to hybridize to the 3' poly(A) tails on mRNA. The 
hybrids were captured using streptavidin coupled to paramagnetic particles and a 
magnetic separation stand. The mRNA was washed at high stringent condition and 
cluted by Rnase-free deionized water. 

Synthesis of the cDNA was performed and unidirectional cDNA libraries 

20 were constructed using the Superscript Plasmid System (Life Technology, Inc., 
Gaithersburg, MD). First strand of CDNA was synthesized by priming an 
oligo(dT) primer containing a Not 1 site. The reaction was catalyzed by 
Superscript Reverse Transcriptase II at 45°C. The second strand of cDNA was 
labeled with a- 32 P-dCTP and portions of the molecules smaller than 500 base pairs 

25 and unligated adapters were removed by Sephacryl-S400 chromatography. The 
selected cDNA molecules were ligated into pSPORTl reference vector between 
the Not I and Sal I sites. 

Individual colonies were picked and DNA was prepared either by PCR with 
Ml 3 forward primers and Ml 3 reverse primers, or by plasmid miniprep isolation. 

30 All the cDNA clones were sequenced using Ml 3 reverse primers. 

Two maize homologues for RPA large subunit (ZmRPALSH) have been 
isolated. The genes map to two different chromosomes as shown below in Table 1. 
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The amino acid and nucleotide sequences for the two homologues are set forth in 
SEQIDNOs: 1-4. 

5 

Table 1 

Maize RPA Large Subunit Genes Map to Two Different Chromosomes 



Clone ID 


Chromosome No. 


Homologue 


CBPBS68 


c9 


ZmRPALSHl 


CCRBJ83 


c9 


ZmRPALSHl 


CDPGS47 


c9 


ZmRP.ALSHl 


CHCLE65 


c9 


ZmRPALSHl 


CJLPL35 


c9 


ZmRPALSHl 


COMGE67 


c9 


ZmRPALSHl 


CBAAK06 


c9 


ZmRPALSH2 


CDPGS46 


c9 


ZmRPALSH2 


CERAG93 


c9 


ZmRPALSH2 


COMFY67 


c9 


ZmRPALSH2 



Ten ESTs, which form two different contigs for maize RPA large subunit, were 
used as probes for mapping experiments Each contig represents one maize 
homologue for RPALS. 

10 Seven maize homologues for RPA middle subunit (ZmRPAMSH) have 

been isolated. The genes map to chromosomes 5 as shown below in Table 2. The 
nucleotide and amino acid sequences of the seven homologues are set forth in SEQ 
IDNOs: 11-22. 
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Table 2 

Maize Homologues of Eukaryotic Replication Protein A Middle Subunit 



Clone ID 


Homologue 


Library 


Map 
Position 


CCRBK.63 


ZmRPAMSH-1 


P0026 


C5 


CGEUZ26 


ZmRPAMSH-2 


P0002 


TBD 


CGEVJ74 


ZmRPAMSH-3 


P0002 


TBD 


CHSBX01 


ZmRPABMS-4 


P0118 


C5 


CIMME04 


ZmRPAMSH-5 


P0114 


C5 


CRTBB78 


ZmRPAMSH-6 j P0041 


C5 


CVRAP89 


ZmRPAMSH-7 


P0057 


C5 



5 TBD = To be determined. 

Example 2: Transformation and Regeneration of Trnsgenic Plants: 

Immature maize embryos from greenhouse donor plants are bombarded 
with a plasmid containing the RPA antisense sequence of the invention operably 
10 linked to a pathogen-inducible promoter (Figure 2) plus a plasmid containing the 
selectable marker gene PAT (Wohlleben et al. (1988) Gene 70:25-37) that confers 
resistance to the herbicide Bialaphos. Transformation is performed as follows. All 
media recipes are in the Appendix. 

1 5 Preparation of Target Tissue 

The ears are surface sterilized in 30% Chlorox bleach plus 0.5% Micro 
detergent for 20 minutes, and rinsed two times with sterile water. The immature 
embryos are excised and placed embryo axis side down (scutellum side up), 25 
embryos per plate, on 560 Y medium for 4 hours and then aligned within the 2.5- 

20 cm target zone in preparation for bombardment. 
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Preparation of DNA 

A plasmid vector comprising the RPA sequence of the invention operafaly 
linked to a ubiquitin promoter is made. This plasmid DNA plus plasmid DNA 
containing a PAT selectable marker is precipitated onto 1 1 jim (average diameter) 
5 tungsten pellets using a CaCh precipitation procedure as follows: 



100 \i\ prepared tungsten particles in water 
10 jlxI (1 |ig) DNA in TrisEDTA buffer (1 jig total) 
100 nl2.5MCaCl 2 
10 10 |il 0.1 M spermidine 



Each reagent is added sequentially to the tungsten particle suspension, 
while maintained on the multitube vortexer. The final mixture is sonicated briefly 
and allowed to incubate under constant vortexing for 10 minutes. After the 

15 precipitation period, the tubes are centrifuged briefly, liquid removed, washed with 
500 ml 100% ethanol, and centrifuged for 30 seconds. Again the liquid is 
removed, and 105 |il 100% ethanol is added to the final tungsten particle pellet. 
For particle gun bombardment, the tungsten/DNA particles are briefly sonicated 
and 10 |il spotted onto the center of each macrocarrier and allowed to dry about 2 

20 minutes before bombardment. 



Particle Gun Treatment 

The sample plates are bombarded at level #4 in particle gun #HE34-1 or 
#HE34-2. All samples receive a single shot at 650 PSI, with a total often aliquots 
25 taken from each tube of prepared particles/DNA. 

Subsequent Treatment 

Following bombardment, the embryos are kept on 560Y medium for 2 
days, then transferred to 560R selection medium containing 3 mg/liter Bialaphos, 
30 and subcultured every 2 weeks. After approximately 10 weeks of selection, 

selection-resistant callus clones are transferred to 288J medium to initiate plant 
regeneration. Following somatic embryo maturation (2-4 weeks), well-developed 
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somatic embryos are transferred to medium for germination and transferred to the 
lighted culture room. Approximately 7-10 days later, developing plantlets are 
transferred to 272V hormone-free medium in tubes for 7-10 days until plantlets are 
well established. Plants are then transferred to inserts in flats (equivalent to 2.5" 
5 pot) containing potting soil and grown for 1 week in a growth chamber, 

subsequently grown an additional 1-2 weeks in the greenhouse, then transferred to 
classic 600 pots (1.6 gallon) and grown to maturity. Plants are monitored and 
scored for expression of the RPA gene of interest. 
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APPENDIX 
272 V 



Ingredient 


Amount 


Unit 


D-I H 2 0 


950.000 


Ml 


MS Salts (GIBCO 11117-074) 


4.300 


G 


Myo-Inositoi 


0.100 


G 


MS Vitamins Stock Solution ## 


5.000 


Ml 


Sucrose 


40.000 


G 


Bacto-Agar @ 


6.000 


G 



Directions: 
5 @ = Add after bringing up to volume 

Dissolve ingredients in polished D-I H 2 0 in sequence 
Adjust to pH 5.6 

Bring up to volume with polished D-I H 2 0 after adjusting pH 
Sterilize and cool to 60°C. 

10 ## = Dissolve 0. 1 00 g of Nicotinic Acid, 0.020 g of Thiamine.HCL; 0. 100 g of 
Pyridoxine.HCL; and 0.400 g of Glycine in 875 .00 ml of polished D-I H 2 0 in 
sequence. Bring up to volume with polished D-I H 2 0 Make in 400 ml portions. 
Thiamine.HCL & Pyridoxine.HCL are in Dark Desiccator. Store for one month, 
unless contamination or precipitation occurs, then make fresh stock. 

15 Total Volume (L) = 1.00 
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288 J 



Ingredient 


Amount 


Unit 


D-I H 2 0 


950.000 


Ml 


MS Salts 


4.300 


G 


Myoinositol 


0.100 


G 


MS Vitamins Stock Solution ## 


5.000 


Ml 


Zeatin .5 mg/ml 


1.000 


Ml 


Sucrose 


60.000 


G 


Gelrite @ 


3.000 


G 


Indoleacetic Acid 0 5 mg/ml # 


2.000 


Ml 


0.1 mM Abscisic Acid 


1.000 


Ml 


Bialaphos 1 mg/ml # 


3.000 


Ml 



Directions: 

@ = Add after bringing up to volume 
5 Dissolve ingredients in polished D-I H2O in sequence 
Adjust to pH 5.6 

Bring up to volume with polished D-I H 2 0 after adjusting pH 

Sterilize and cool to 60°C. 

Add 3.5g/L of Gelrite for cell biology. 
10 ## = Dissolve 0. 100 g of Nicotinic Acid, 0.020 g of Thiamine.HCL; 0.100 g of 

Pyridoxine.HCL; and 0 400 g of Glycine in 875 .00 ml of polished D-I H 2 0 in 

sequence Bring up to volume with polished D-I H 2 0. Make in 400 ml portions. 

Thiamine.HCL & Pyridoxine.HCL are in Dark Desiccator. Store for one month, 

unless contamination or precipitation occurs, then make fresh stock. 
15 Total Volume (L) = 1.00 
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560 R 



Ingredient 


Amount 


Unit 


D-I Water, Filtered 


950.000 


Ml 


CHU (N6) Basal Salts (SIGMA C-1416) 


4.000 


G 


Eriksson's Vitamin Mix (1000X SIGMA- 151 1) 


1.000 


Ml 


Thiamine. HCL 0.4mg/ml 


1.250 


Ml 


Sucrose 


30.000 


G 


2, 4-D0.5mg/ml 


4.000 


Ml 


Gelrite @ 


3.000 


G 


Silver Nitrate 2mg/ml # 


0.425 


Ml | 


Bialaphos 1 mg/ml # 


3.000 


Ml 



Directions: 
5 @ = Add after bringing up to volume 

# = Add after sterilizing and cooling to temp. 

Dissolve ingredients in D-I H 2 0 in sequence 

Adjust to pH 5.8 with KOH 

Bring up to volume with D-I H 2 0 
1 0 Sterilize and cool to room temp. 

Total Volume (L) = 1.00 
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560 Y 



Ingredient 


Amount 


Unit 


D-I Water, Filtered 


950.000 


Ml 


CHU (N6) Basal Salts (SIGMA C-1416) 


4.000 


G 


Eriksson's Vitamin Mix (1000X SIGMA-151 1) 


1.000 


Ml 


Thiamine. HCL 0.4mg/ml 


1.250 


Ml 


Sucrose 


120.000 


G 


2,4-D 0.5mg/ml 


2.000 


MI 


L-Proline 


2.880 


G 


Gelrite @. 


2.000 


G 


Silver Nitrate 2mg/ml # 


4.250 


Ml 



Directions: 

@ = Add after bringing up to volume 
5 # = Add after sterilizing and cooling to temp. 

Dissolve ingredients in D-I H2O in sequence 

Adjust to pH 5. 8 with KOH 

Bring up to volume with D-I H2O 

Sterilize and cool to room temp. 
10 ** Autoclave less time because of increased sucrose** 

Total Volume (L) = 1.00 

All publications and patent applications mentioned in the specification are 

indicative of the level of those skilled in the art to which this invention pertains. All 

publications and patent applications are herein incorporated by reference to the same 
15 extent as if each individual publication or patent application was specifically and 

individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by way of 

illustration and example for purposes of clarity of understanding, it will be obvious that 

certain changes and modifications may be practiced within the scope of the appended 
20 claims. 
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INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 



(PCT Rule \3bis) 



A The indications made beiow relate to the deposited microorganism or other biological material referred to in the description on page 5, lines 5, 
8 and 13 



B. IDENTIFICATION OF DEPOSIT 


Further deposits are identified on an additional sheet □ 


Name of depository institution 

American Type Culture Collection 


Address of depositary institution (including postal code and country) 




10801 University Blvd. 
Manassas, VA 20110-2209 
USA 




Date of deposit 

21 August 1998 (21.08.98) 


Accession Number 

98843 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) 


This information is continued on an additional sheet □ 



Accession No. 98754 - page 5, lines 5, 8 and 13 - Date of deposit : 26 May 1998 (26.05.98) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indicators are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications e.g., 'Accession 
Number of Deposit") 



For receiving Office use only 



O This sheet was received with the international application 



Authorized officer 



For International Bureau use only 



□ This sheet was received with the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1998) 
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THAT WHICH IS CLAIMED: 

1 . An isolated protein having the amino acid sequence selected from 
the group consisting of. 

a) an amino acid sequence of a maize replication protein A 

5 large subunit, 

b) an amino acid an amino acid sequence of a plant replication 
protein A middle subunit; 

c) an amino acid sequence set forth in SEQ ID NO 2, SEQ ID 
NO 4, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO 16, SEQ ID NO 18, SEQ 

10 ID NO: 20, and SEQ ID NO: 22; 

d) an amino acid sequence having substantial identity to an 
amino acid sequence of a), b), or c); 

e) an amino acid sequence comprising at least 20 contiguous 
residues of an amino acid sequence of a), b, or c); 

15 f) a variant of an amino acid sequence of a), b, or c). 

2. An isolated nucleotide sequence selected from the group consisting 

of: 

a) a nucleotide sequence encoding a maize replication protein 
20 A (RPA) large subunit; 

b) a nucleotide sequence encoding a plant replication protein A 
(RPA) middle subunit; 

c) a nucleotide sequence set forth in SEQ ID NO: 1, SEQ ID 
NO 3, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO 15, SEQ ID NO: 17, SEQ 

25 ID NO: 19, and SEQ ID NO: 21; 

d) a nucleotide sequence comprising at least 20 contiguous 
nucleotides of a nucleotide sequence of a) b), or c); 

e) an antisense nucleotide sequence corresponding to a 
nucleotide sequence of a), b), or c); 

30 f) a nucleotide sequence that hybridizes to the nucleotide 

sequences of a), b), or c) under stringent conditions; and 
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g) a nucleotide sequence that encodes an amino acid sequence 
according to claim 1. 

3. A DNA construct comprising a nucleotide sequence according to 
5 claim 2 wherein said nucleotide sequence is operabiy linked to a promoter that 

drives expression in a plant cell. 

4. The DNA construct of claim 3, wherein said promoter is a tissue- 
preferred promoter. 

10 

5. The DNA construct of claim 4, wherein said promoter is a 
pathogen-inducible promoter. 

6. The DNA construct of claim 5, wherein said nucleotide sequence is 
15 an antisense sequence. 

7. The DNA construct of claim 3, wherein said promoter is a 
constitutive promoter. 

20 8. A method for enhancing homologous recombination in a plant cell, 

said method comprising transforming said plant cell with at least one nucleotide 
sequence operabiy linked to a heterologous promoter that drives expression in a 
plant cell, said nucleotide sequence selected from the group consisting of: 

a) a nucleotide sequence encoding a maize replication protein 
25 A (RPA) large subunit; 

b) a nucleotide sequence encoding a plant replication protein A 
(RPA) middle subunit; 

c) a nucleotide sequence set forth in SEQ ID NO; 1, SEQ ID 
NO: 3, SEQ ID NO: 1 1, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ 

30 ID NO: 19, and SEQ ID NO: 21, 

d) a nucleotide sequence comprising at least 20 contiguous 
nucleotides of a nucleotide sequence of a) b), or c), 
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e) a nucleotide sequence that hybridizes to the nucleotide 
sequences of a), b), or c) under stringent conditions; and 

f) a nucleotide sequence that encodes an amino acid sequence 
according to claim 1. 

5 

9 The method of claim 8, wherein said promoter is a constitutive 
promoter 

10. The method of claim 9, wherein said promoter is an ubiquitin 
10 promoter. 

11. A method for increasing pathogen resistance in a plant cell, method 
comprising transforming said plant cell with at least one nucleotide sequence 
operably linked to a pathogen-inducible promoter said nucleotide sequence 

1 5 selected from the group consisting of: 

a) an antisense nucleotide sequence corresponding to a maize 
replication protein A large subunit, and 

b) an antisense nucleotide sequence corresponding to a plant 
replication protein A middle subunit. 

20 

12. A transformed plant cell having stably incorporated into its genome 
at least one nucleotide sequence, said nucleotide sequence operably linked to a 
heterologous promoter that drives expression in a plant cell, wherein said 
nucleotide sequence is selected from the group consisting of: 

25 a) a nucleotide sequence encoding a maize replication protein 

A (RPA) large subunit; 

b) a nucleotide sequence encoding a plant replication protein A 
(RPA) middle subunit; 

c) a nucleotide sequence set forth in SEQ ED NO: 1, SEQ ID 
30 NO: 3, SEQ ID NO: 1 1, SEQ ID NO 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ 

ED NO: 19, and SEQ ID NO: 21; 

d) a nucleotide sequence comprising at least 20 contiguous 
nucleotides of a nucleotide sequence of a) b), or c); 
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e) an antisense nucleotide sequence corresponding to a 
nucleotide sequence of a), b), or c); 

f) a nucleotide sequence that hybridizes to the nucleotide 
sequences of a), b), or c) under stringent conditions; and 

5 g) a nucleotide sequence that encodes an amino acid sequence 

according to claim 1. 

13. A transformed plant having stably incorporated into its 
genome at least one nucleotide sequence, said nucleotide sequence operably linked 
10 to a heterologous promoter that drives expression in a plant cell, wherein said 
nucleotide sequence is selected from the group consisting of: 

a) a nucleotide sequence encoding a maize replication protein 
A (RPA) large subunit, 

b) a nucleotide sequence encoding a plant replication protein A 
15 (RPA) middle subunit; 

c) a nucleotide sequence set forth in SEQ ID NO: 1, SEQ ID 
NO: 3, SEQ ID NO 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ 
ID NO: 19, and SEQ ID NO: 21; 

d) a nucleotide sequence comprising at least 20 contiguous 
20 nucleotides of a nucleotide sequence of a) b), or c); 

e) an antisense nucleotide sequence corresponding to a 
nucleotide sequence of a), b), or c); 

0 a nucleotide sequence that hybridizes to the nucleotide 
sequences of a), b), or c) under stringent conditions; and 
25 g) a nucleotide sequence that encodes an amino acid sequence 

according to claim 1 . 

14. Seed of the plant of claim 13 . 

30 15. The plant claim 13, wherein said plant is a monocot. 

16. The plant of claim 15, wherein said monocot is maize, wheat, rice, 
barley, sorghum, or rye. 
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17. The plant of claim 13, wherein said plant is a dicot. 

18. The plant of claim 17, wherein said dicot is selected from the group 
5 consisting of soybean, canola, sunflower, alfalfa, or safflower. 

19. Seed of the plant of claim 17. 

20. A method for modulating DNA metabolism in a plant cell, said 
10 method comprising transforming said plant cell with at least one nucleotide 

sequence operably linked to a promoter wherein said nucleotide sequence is 

selected from the group consisting of: 

a) a nucleotide sequence encoding a maize replication protein 

A (RPA) large subunit; 
15 b) a nucleotide sequence encoding a plant replication protein A 

(RPA) middle subunit; 

c) a nucleotide sequence set forth in SEQ ID NO: 1, SEQ ID 

NO: 3, SEQ ID NO: 1 1, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ 

ID NO: 19, and SEQ ID NO: 21; 
20 d) a nucleotide sequence comprising at least 20 contiguous 

nucleotides of a nucleotide sequence of a) b), or c); 

e) an antisense nucleotide sequence corresponding to a 
nucleotide sequence of a), b), or c); 

f) a nucleotide sequence that hybridizes to the nucleotide 
25 sequences of a), b), or c) under stringent conditions, and 

g) a nucleotide sequence that encodes an amino acid sequence 
according to claim 1 . 

21. A method for influencing cell cycle in a plant cell, said method 
30 comprising transforming said plant cell with at least one nucleotide sequence 

operably linked to a promoter wherein said nucleotide sequence is selected from 
the group consisting of: 
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a) a nucleotide sequence encoding a maize replication protein 
A (RPA) large subunit, 

b) a nucleotide sequence encoding a plant replication protein A 
(RPA) middle subunit; 

c) a nucleotide sequence set forth in SEQ ID NO: 1, SEQ ID 
NO: 3, SEQ ID NO: 1 1, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ 
ID NO: 19, and SEQ ID NO: 21; 

d) a nucleotide sequence comprising at least 20 contiguous 
nucleotides of a nucleotide sequence of a) b), or c); 

e) an antisense nucleotide sequence corresponding to a 
nucleotide sequence of a), b), or c); 

f) a nucleotide sequence that hybridizes to the nucleotide 
sequences of a), b), or c) under stringent conditions; and 

g) a nucleotide sequence that encodes an amino acid sequence 
according to claim 1. 
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1/4 



ZMRPALSH1 
ZMRPALSH2 
024183 
Rfal.Xenla 
RfQl Hunan 
Rfa l.Drone 
Rf a I Schpo 
Rfa I "Yeast 



ZMRPALSH1 
ZMRPALSH2 
024183 
Rfal.Xenla 
Rfa I.Hunan 
Rfa l.Drone 
Rfa I Schpo 
Rfal'Yeast 



ZMRPALSH1 
ZMRPALSH2 
024183 
Rfal.Xenla 
Rfa I Hunan 
Rfa I .Drone 
Rfa I Schpo 
Rfa T Yeast 

ZMRPALSH1 
ZMRPALSH2 
024183 
Rfal.Xenla 
Rfa I Hunan 
Rfa I "Drone 
Rfa I Schpo 
RfaTYeast 

ZMRPALSH1 
ZMRPALSH2 
024183 
Rfa I Xenla 
Rfa I Hunan 
Rfa I .Drone 
Rfa I Schpo 
Rfal'Yeast 



1 

— MDAAKSVT 
— MDAAKLVT 
MDSDAAPSVT 
— MALPQLS 

MVGQLS 

— MVLASLS 

MAERLS 

— MSSVQLS 

51 

FSFTASDGND 
FSFTATDGKD 
FTFLASDGKD 
YRLLMSDGLN 
YRLLMSDGLN 
YRILISDGKY 
YRVVLSDSIN 
NLIMISDGIY 

101 

NADKVLIVVK 
KDDKVLVVIK 
KQEKVLIITK 
.GRRVIIVME 
.GRRVVILME 
AGKRVLIISE 
. .RKILIVLG 
RKKYVLLVDD 

151 

ETNSPP..L. 
. . .TSP..L. 
PTNAPP. .LP 
. . PAPSKLQ 
SSRPQPQNGS 
IAKKEPSHNN 
TGISTSTNSF 
VA. . . .NQTN 

201 

TLNPYQGNWV 
TLNPYQGNWV 
SLNPYQGNWI 
SLNPYQSKVT 
SLTPYQSKWT 
SLSPYQNKWV 
GLSPYQNKWT 
QLSPYQNVWT 



PGAVSYIL.. 
PVAVSHIL.. 
PGAVAFVLEN 
EGAISA-MLG 
EGAIAAIMQK 
TGVIARIM.H 
VGALRI INTS 
RGDFHSIFTN 



KIKA.MLPTY 
KIKA.MLPTN 
KIKT.MLLTQ 
TLSSFMLATQ 
TLSSFMLATQ 
FNSYAMLASQ 
YAQS.MLSTQ 
HMKA.LLRNQ 



AHPSTGSDGA 
AHPSAGSDGA 
ASPDAATGVP 
GDSSC. . KPT 
GDTNI. .KPI 
GEVVD. . APV 
DASSFPPNPI 
KQR. .YDNPT 



FASEVHSGNL 
FGSEVRSGNL 
LAPEVRSGNI 
LNSLVDNNLL 
LNPLVEEEQL 
LNVMQHNGEL 
LNHLVAENKL 
AASKFQSMEL 



VSDLVVQVLD 
VTDLVVQVLD 
VPEIVLQVVD 
LQVINIRPIN 
LQVINIRPIT 
LQILAIKKIN 
LQVLTVKELN 
GGVYQVYNTR 



KNFGLIRILD 
KNLGLIRIID 
QNLGVIRVLD 
ATNCICQVSR 
SSNCVCQIHR 
EEFTIVQLDK 
QKGAGVQLTQ 
QRGDIIRV. . 



CETVCEA . . L 
CELVCQA . . L 
LEVVFKA . . L 
LDVLKSADLV 
LEVLKSAEAV 
LTVVNPGAEV 
LNVLTELG.V 
FELVQSRADM 

. .VMKPKQEV 
. .VMKPKQEV 
PVVLKPKQEV 
NNSAPPPSMN 
SGMGSTVSKA 

NN 

YGNNAAATAP 
ASNAGVPDML 



DAEINGEAKK 
DAEINGEAKK 
DSEIKCEAEK 
MGKIGNPQPY 
GVKIGNPVPY 
KSKIGEPVTY 
MDKIGNPAGL 
VNQTSTFLDN 

KSASQIVTEQ 
KSASQIVTEQ 
KSASQIVNEQ 
RGTSKLFG . . 
YGASKTFGKA 



IKVRVTSKGN 
IKVRVTSKGN 
IKVRVTSKGN 
VRARVTNKGQ 
ICARVTNKSQ 
IKARVTSKSG 
IRARVTNKSE 
IKARVSYKGE 



APPPMMKKPA 
HSNSNLNANE 

LRTYRNARGE 
LRTYRNARGE 
LRTYKNARGE 
IRTWSNSRGE 
IRTWSNSRGE 
IRTWSNARGE 
VKHWHNQRGE 
IKTVHNGRGD 



ED. .PPIVLK 

EE. .PPIVLK 
QEEKPAILLS 
ND . . GGPQPA 
NEGLGQPQVA 
ENAAKQDLAP 
ETVDALRQQQ 
YFSEHPNETL 

RGNAAPATRL 
RGNAAPATRL 
RGNAAPAARL 
.GGSLLNTPG 
AGPSLSHTSG 
. . .NIVMNSS 

APNSL 

RKFANENPNS 

GCVFNVELTD 
GCVFNVELTD 
GCVFNVELTD 
GKLFSIEMVD 
GKLFSLELVD 
GKLFSMDLMD 
GKLFSVNLLD 
GKLFNVNFLD 



50 

LKSIGMGS. R 
LKSVGTGS.R 
LKPIGT. . .R 
. . . TGNGPPR 
. . . TGNSPPR 
. . . SAADSER 
SNPTSGAPKR 
KSDGANSNRK 

100 

YTCNSVK..G 
YTCNVVK. .G 
YTCNTIG. .E 
FIVNNL.KD. 
FIVNTL.KD. 
YVTSLVGKDG 
FTVNVMKE. . 
IIAEPAIVRE 

150 

PKDEGSVVAE 
PKDECVGV. . 
PKEESVVLSK 
APAPASAPA . 
PPAPAASPAA 
KPAVTSNSKP 
NEQNNASAPR 
KDEDITDSGN 

200 

SMTRRVHPLI 
SMTRRVHPLI 
AMTRRVHPLI 
GSQSKVVPIA 
GTQSKVVPIA 
INSGMTHPIS 
..STIIYPIE 
QKTRPIFAIE 

250 

EDGTQIQATM 
EDGTQIQATM 
VDGTQIQATM 
ESG EIRATA 
ESG EIRATA 
ESG EIRATA 
ESG EIRATG 
TSG EIRATA 



TO FIG. 1B. 
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FROM FIG. 1A. 



ZMRPALSH1 
ZMRPALSH2 
024183 
Rfal.Xenla 
Rfal.Hunan 
RfQ l.Drone 
Rfal Schpo 
Rfa I .Yeast 

ZMRPALSH1 
ZMRPALSH2 
024183 
Rfal.Xenla 
RfQ I .Hunan 
Rfa l.Drone 
Rfal.Schpo 
Rfal .Yeast 

ZMRPALSH1 
ZMRPALSH2 
024183 
RfaLXenla 
Rfal Hunan 
Rfal.Drone 
Rfal Schpo 
RfaTYeast 

ZMRPALSH1 
ZMRPALSH2 
024183 
Rfal Xenla 
Rfal.Hunan 
Rfal.Drone 
Rfal Schpo 
RfaTYeast 

ZMRPALSH1 
ZMRPALSH2 
024183 
Rfal.Xenla 
Rfal.Hunan 
Rfal.Drone 
Rfa I. Schpo 
Rfal.Yeast 



251 

FNEAAKKFYP 
FNDAAKKFYP 
FNEAAKKFYP 
FNEQADKFFS 
FNEQVDKFFP 
FKEQCDKFYD 
FNDQVDAFYD 
FNDF ATKFNE 

301 

EAE..GETFL 
EAE..GETCI 
EAE..GETFI 
PCDDSAD . . V 
PCEDDH..L 
LCEDTDDDPI 
KAED . . QTAV 
ECFDESN..V 

351 

KIDNETIPKR 
KIDNETIPKR 
KIDNETIPKR 
KSNNREVSKR 
RSNNREVAKR 
RTTNKEFKKR 
RATSRGFDKR 
RA.GKKFDRR 

401 

KSLKVSDFQ. 
KSLKVSDFQ. 
KSLKVSDFQ. 
KGARLSDF. G 
KGARVSDF. G 
KGTRINEFNG 
KGVKVNDFQ . 
KGVRVTDF. G 



IFELGKVYYV 
IFELGKVYYV 
MFELGKVYYI 
I IEVNKVYYF 
LIEVNKVYYF 
LIQVDSVYYI 
ILQEGSVYYI 
ILQEGKVYYV 

PPVQYNLVKI 
PQVQYNLVKI 
PQIQYNFVKI 
PMVQFEFVSI 
PTVQFDFTGI 
PEIKYNLVPI 
PVAKFSFVSL 
PKTHFNFIKL 

DIVVADDSGK 
DIVVADDSGK 
DIVVADDSSK 
SIHLMDSSGK 
NIYLMDTSGK 
DITLVDMSNS 
DITIVDQTGY 
DITIVDDSGF 

GVSLSTIGRS 
GVSLSTVGKS 
GLSLSTVGRS 
GRSLSVLSSS 
GRSLSVLESSS 
GKSLSLGGGS 
GRSLSMLTSS 
GKSLSMGFSS 



SKGSLRIANK 
SKGSLRIANK 
SKGSLRVANK 
SKGTLKIANK 
SKGTLKIANK 
SKCQLKPANK 
SRCRVNIAKK 
SKAKLQPAKP 

DQLGPYVGGR 
DQLGSYVGGR 
DQLGPYVGGR 
GELES. KNKD 
DDLEN. KSKD 
SDVSG.MENK 
QEVGD.VAKD 
DAIQN.QEVN 

TVTISLWNDL 
TVSISLVNDL 
TVTISLVNDL 
VVSTTLWGED 
VVTATLWGED 
AISLTLWGDD 
EMRVTLWGKT 
SISVGLWNQQ 

TLEINPDLPE 
TLAINPDLHE 
TIVVNPDLPE 
TVMINPDIPE 
TIIANPDIPE 
IMKINPDIPE 
TMSVDPDIQE 
TLIPNPEIPE 



300 

LSLNENAIVE 
MSLNENAIVE 
MTLNENAVVE 
MTFNSETSVI 
MTFNNETSVM 
MTFSGETVVQ 
LMFERDTEIR 
LNLDRDTVIE 

350 

SVSPTLSVRR 
SVSPTLSVRR 
SVSPTLSVRR 
NVEEVTKVTI 
SYEDATKITV 
EVGELQSFVA 
NVGPVQQITS 
TINPHFELTS 

400 

ATTTGQELLD MVDSSPVVAI 
ATTTGQELLD MADSSPVVAI 
ATTTGQELLD MVDSAPIIAI 

ADKFD GSRDPVVAI 

ADKFD GSRQPVLAI 

AVNFD GHVQPVILV 

AIEFS VSEESILAF 

ALDFN LPEGSVAAI 



QFKTVKNDYE 
QFKTVQNDYE 
QFKTVHNDYE 
QYTSVKNDYE 
QFTAVKNDYE 
QYSSLNNAYE 
QYTNVQNEYE 
QFTNLTHPYE 

ELVDIVGVVQ 
ELVDIVGVVQ 
ELVDVIGVVQ 
TVLDIIGVCK 
SLVDIIGICK 
AAVDTIGICK 
AVIDVIGVLQ 
SNVDVLGIIQ 



451 

SAEAGATRAG G . . FKSMYSD RVFLSHITSD 
GAEMGAARAG G. . FKSTYSD RVFLSHITSD 
GSDMGASRVG G. . ARSMYSD RVFLSHITSD 

SESRGG.GTG GGN TN WKSLLEVKNE 

SDLKSG.GVG GSN TN WKTLYEVKSE 

SARTGG. . . G SFS TE WMTLKDARAR 

SVISSTLSTT GRS AE RKNIAEVQAE 

KQEPGMGGQS AASLTKFIAQ RITIARAQAE 
- T0Raia 



AKNLKSWYDS 
AQNLKSWYDS 
AEQLRAWYDS 
AFKLRAWFDS 
AYKLRGWFDA 
AHKLRGWFDN 
SHLLDGWYDG 
AYALKGWYDS 

PAMGQEKPVF 
PAMGQEKPVF 
PNLGQDKPVF 
NLGHGEKADY 
NLGQGDKPDY 
NLGSGDKPDY 
HLGMSETPDY 
NLGRSEKGDF 



450 

EGKDTSLAPI 
EGKDTSLAPI 
EGKGTSMASI 
EGQVVEGTSI 
EGQALDGVSI 
GGGDSVANMV 
QGRGQEFAKH 
KGRNANFITL 

500 

FSLYAIISHI 
FSLYATISHI 
FSLNAYISLI 
FTSVATIVYL 
FSSVATVVYL 
FQCKAVVHIV 
FSLKGTIVYI 
FSVKAAISFL 
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FROM FIG. 1B. 



ZMRPALSH1 
ZMRPALSH2 
024183 
Rfal.Xenla 
Rf q I Hunan 
RfQ I .Drone 
Rfal Schpo 
RfaTYeast 

ZMRPALSH1 
ZMRPALSH2 
024183 
Rfal.Xenla 
Rfo I Hunan 
Rf a I .Drone 
Rfal Schpo 
RfaTYeast 

ZMRPALSH1 
ZMRPALSH2 
024183 
Rfal.Xenla 
Rfal Hunan 
Rfal .Drone 
Rfal Schoo 
RfaTYeasx 

ZMRPALSH1 
ZMRPALSH2 
024183 
Rfal.Xenla 
Rfal Hunan 
Rfal Drone 
Rfal Schpo 
Rfa I .Yeast 



501 

KPDQNMVYRA 
KPDQNMWYRA 
KPDQTMWYRA 
RKE.NCLYQA 
REK.NCMYQA 
KQE.NAFYRA 
RKK.NVSYPA 
KVD. NFAYPA 

551 

SDPTGEAWVS 
SDPTGEAWFS 
SDPTGEAWLS 
ADFGENQWIT 
ADFQENQVVT 
GDWTSNRWVS 
GDHTGQLVLN 
IDETNQLWLT 

601 

HLFRVSVTQH 
HLFRVSVTQH 
HLFRVSVTQN 
YTFRARVKLE 
FIFRVRVKVE 
HIFKLRCKNE 
YIFQCRAKQD 
YDFRIRARED 



651 



377. VNKXV 

w. .mm 

WT.WKKV 
QPSQWNKKV 
TTQDCNKKV 
T0SDONKKV 
TAAimKKV 
WE1&KKV 

VFNEHAEKII 
VFNEHAEKII 
LFNDQAERIV 
CFQESAISIL 
CFQESAEAIL 
SFNEVGEQLL 
VFDDVGKLIM 
LFDDQAKQLL 



EYMNEKRQRI 
EYNNEKRQRI 
EYMNEKRQRI 
TYNDESRIKA 
TYNDESRIKA 
VYGDMTRNKL 
NFKGENRVRY 
TYNDQSRIRY 



TEAFGSGYWE 
TETFGSGYK 
TEAMGSGYK 
IDOONGLFRC 
IDCMGLYRC 
VDEGNDQFRC 
FDQG. GSWRC 
LEQPWTWFQ. 

GCSADELDRI 
GCSADELDRI 
GCSADELDRI 
GQNATYLGEL 
GGNAAYLGEL 
GHTSQEVGEA 
HKTADELNDL 
GVDANTLMSL 

TVRGEAPVDF 
TVRSEAPVEH 
TVRSEAPVDH 
TAVDVKPVDH 
TVMDVKPVDY 
TVQSVAPINH 
TVMSINQMDW 
TVANLHSLNY 



£C0QKNDSEC 
£G£]QKNDSEC 
iOTKNDAEC 

otnkefpnf 
otdtefpnf 
™nalfpnf 

£J£MEYDAP 
OTDTNNARP 

RKEEGDDSYV 
RKEEGDDSYV 
RKEEGDDSYL 
.KEKNEQAYD 
. KDKNEQAFE 
.LENDPAKAE 
. QENDENAFM 
. KEEDPNEFT 

AAESKYLLEE 
AAESKYLLEQ 
AAEAKYMLEE 
KEYSRRLIMN 
REYGRRLVMS 
KEYNKHLLKE 
KEESKRLINF 
RAEADYLADE 



550 

SLRYIMVIKL 
SLRYIMVIKV 
SLRYIMVIKV 
KYRLILSANI 
KYRMILSVNI 
KYRLLINMSI 
GYRYIITIAV 
NWRYILTISI 

600 

LKLKEATWVP 
LKLKEATWVP 
LKLKEATWVP 
EVFQNANFRS 
EVFQNANFRS 
QIFSALNFTS 
NCMAEACYMP 
KITQSIQMNE 

650 

IAKLTAC*-- 
IAKLTA***- 
IAKLTGC — 
IRKMATQGV- 
IRRSALM — 
LQELTGIGSS 

IESAQ 

LSKALLA — 



N 
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SEQUENCE LISTING 

<110> Mahajan, Pramod 3. 

<120> Maize Replication Protein A and Use 

<130> 5718-59 
<160> 22 

<170> FastSEQ for Windows Version 3.0 



<210> 1 
<21I> 2497 
<212> DNA 
<213> 2ea Mays 

<220> 
<221> CPS 

<222> (157) . . . {2025) 

<223> Coding sequence for the Maize RPA Large Subunit 
Homologue-1 

<221> nusc_f eature 
<222> (0) . . . (0) 

<223> Maize RPA Large subunit Homologue-1 
<400> 1 

ccttatcata ttataagcgc gcgtagcctt ggcagcrcga cgcatcttcg cctccgctca 
acgctcgccc acgcccccag cccccaccga tccacgagaa accttctcgc ctccgcggga 
cgattcgcca gggagagcaa aggtagcaga ggcgcc atg gac get gcc aag teg 

Met Asp Ala Ala Lys Ser 
1 5 



60 
120 
174 



gtg acg ccg ggc gcc grg tec tac ate ctg gcg cac ccg tct acg ggc 
Val Thr Pro Gly Ala Val Ser Tyr lie Leu Ala His Pro Ser Thr Gly 
10 15 20 



tec gat ggc gcc gtg teg gat etc gtc gtt cag gtc etc gat etc aag 
Ser Asp Gly Ala Val Ser Asp Leu Val Val Gin Val Leu Asp Leu Lys 
25 30 35 



270 



tec ate ggc atg ggc age egg ttc agt ttc acg gca tec gat ggg aac 
Ser He Gly Met Gly Ser Arg Phe Ser Phe Thr Ala Ser Asp Gly Asn 
40 45 50 



318 



gac aaa ate aag gcg atg etc ccc act tac ttt gcg teg gag grc cac 
Asp Lys He Lys Ala Met Leu Pro Thr Tyr Phe Ala Ser Glu Val His 
55 60 65 70 



366 



tec ggc aat ctg aag aat ttc ggt etc ate cgc ate etc gac tac act 
Ser Gly Asn Leu Lys Asn Phe Gly Leu He Arg lie Leu Asp Tyr Thr 
75 80 85 



414 



tgc aac tec gtc aaa ggc aac get gac aaa gtc ctg att gtc gtc aaa 



462 



1 
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Cys Asn Ser Val Lys Gly 
90 

tgc gag act gtg tgc gaa 

Cys Glu Thr Val Cys Glu 
105 

aag aaa gag gat cct cca 

Lys Lys Glu Asp Pro Pro 
120 

gtc gtg get gag gaa aca 

Val Val Ala Glu Glu Thr 

135 140 

caa gag gtg aag tec gcg 
Gin Glu Val Lys Ser Ala 
155 

get get cct gec acg cgc 
Ala Ala Pro Ala Thr Arg 
170 

ate act ctg aac ccc tac 
He Thr Leu Asn Pro Tyr 
185 

acg age aaa ggc aat ctg 
Thr Ser Lys Gly Asn Leu 
200 

tgc gtc ttc aac gta gag 
Cys Val Phe Asn Val Glu 
215 220 

gee ace atg ttt aac gag 
Ala Thr Met Phe Asn Glu 
235 

ctg gga aag gtc tat tat 
Leu Gly Lys Val Tyr Tyr 
250 

aag cag ttc aag aca gtc 
Lys Gin Phe Lys Thr Val 
265 

aat get att gtt gaa gaa 
Asn Ala He Val Glu Glu 
280 

caa tac aac ctt gtc aag 
Gin Tyr Asn Leu Val Lys 
295 300 

agg gag ctt gta gat att 
Arg Glu Leu Val Asp lie 



Asn Ala Asp Lys Val Leu 
95 

gcg etc gac gee gag ate 

Ala Leu Asp Ala Glu He 
110 

att gtg ctg aag cct aaa 

He Val Leu Lys Pro Lys 

125 " 130 

aat tct ccc cca etc gtg 
Asn Ser Pro Pro Leu Val 
145 

tec cag ate gtg act gag 
Ser Gin lie Val Thr Glu 
160 

ctt tec atg aca acg agg 
Leu Ser Met Thr Arg Arg 
175 

cag ggt aac tgg gtc att 
Gin Gly Asn Trp Val lie 
190 

aga ace tac agg aat get 
Arg Thr Tyr Arg Asn Ala 
205 210 

ctt act gat gag gat ggc 
Leu Thr Asp Glu Asp Gly 
225 

get cca aac aag ttc tat 
Ala Ala Lys Lys Phe Tyr 
240 

gtc tea aaa gga tct ctt 
Val Ser Lys Gly Ser Leu 
255 

aaa aat gac tat gag ttg 
Lys Asn Asd Tyr Glu Leu 
270 

gca gag ggg gag act ttc 
Ala Glu Gly Glu Thr Phe 
285 290 

att gat cag eta gga cca 

He Asp Gin Leu Gly Pro 
305 

gtt ggt gtg gtt cag age 

Val Gly Val Val Gin Ser 



lie Val Val Lys 
100 

aac ggc gag gee 510 

Asn Gly Glu Ala 

115 

gac gaa ggc tea 558 

Asp Glu Gly Ser 



atg aag cct aag 606 
Met Lys Pro Lys 
150 

cag cgt gga aat 654 
Gin Arg Gly Asn 
165 

gtc cat ccc ttg 702 
Val Hxs Pro Leu 
180 

aag gtg egg gtc 750 

Lys Val Arg Val 

195 

cgt gga gaa ggc 798 
Arg Gly Glu Gly 



ace cag ate cag 846 
Thr Gin He Gin 
230 

cca att ttt gag 894 
Pro lie Phe Glu 
245 

aga att gee aac 942 
Arg He Ala Asn 
260 

tea eta aac gag 990 

Ser Leu Asn Glu 

275 

ctt cca cca gtg 1038 
Leu Pro Pro Val 



tac gtc ggt ggc 1086 
Tyr Val Gly Gly 
310 

gta tct ccc aca 1134 
Val Ser Pro Thr 



2 
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etc agt gtt 
Leu Ser Val 



att gtt gta 
lie Val Val 
345 

aat gat ctt 
Asn Asp Leu 
360 

agt teg cct 
Ser Ser Pro 
375 

ggc gtg tct 
Gly Val Ser 



gac ctg cct 
Asp Leu Pro 



aaa gat act 
Lys Asp Thr 
425 

get ggt ggt 
Ala Gly Gly 
440 

ate acc agt 

He Thr Ser 
455 

ctg tac gec 
Leu Tyr Ala 



cgt get tgc 
Arg Ala Cys 



gga tac tgg 
Gly Tyr Trp 
5 05 

agg tac ate 
Arg Tyr He 
520 

gtg tee gtg 
Val Ser Val 
535 



315 

age aga aag 
Arg Arg Lys 
330 

gca gac gac 
Ala Asp Asp 



get act acg 
Ala Thr Thr 



gtt gtt gcg 
Val Val Ala 
380 

ctt tea act 
Leu Ser Thr 
395 

gag get aag 
Glu Ala Lys 
410 

tea ctg gca 
Ser Leu Ala 



ttc aag tee 
Phe Lys Ser 



gat cct get 

Asp Pro Ala 
460 

ate ata age 
He He Ser 
475 

acg ace tgt 
Thr Thr Cys 
490 

tgc gag ggg 
Cys Glu Gly 



atg gtg ate 
Met Val He 



ttc aac gag 
Phe Asn Glu 
540 



att gac aac 
He Asp Asn 
335 

tct ggc aaa 
Ser Gly Lys 
350 

act ggc caa 
Thr Gly Gin 
365 

ata aag age 
lie Lys Ser 



att ggc aga 
He Gly Arg 



aat ctt aag 
Asn Leu Lys 
415 

cca ate agt 
Pro He Ser 
430 

atg tat tcr 
Met Tyr Ser 
445 

atg ggc cag 

Met Gly Gin 



cac ate aag 
His He Lys 



aac aag aag 
Asn Lys Lys 
495 

tgc caa aag 
Cys Gin Lys 
510 

aag etc tec 
Lys Leu Ser 
525 

cat gcg gag 
His Ala Glu 



320 

gag aca ata 
Glu Thr He 



act gtt act 
Thr Val Thr 



gag ctt ttg 
Glu Leu Leu 
370 

eta aaa gta 
Leu Lys Val 
385 

agt act etc 
Ser Thr Leu 
400 

tec tgg tat 
Ser Trp Tyr 



gca gaa gcg 
Ala Glu Ala 



gat aga gtt 
Asp Arg Val 
450 

gaa aag cct 
Glu Lys Pro 
465 

cct gat cag 
Pro Asp Gin 
480 

gtg act gaa 
Val Thr Glu 



aat gac tct 
Asn Asp Ser 



gat ccc act 
Asp Pro Thr 
530 

aag ate ate 
Lys He He 
545 



325 

ccg aag cgt 
Pro Lys Arg 
34 0 

att tct etc 
lie Ser Leu 
355 

gac atg gtt 
Asp Met Val 



tct gac ttc 
Ser Asp Phe 



gag att aat 
Glu He Asn 
405 

gac tct gaa 
Asp Ser Glu 
420 

ggt gee aca 
Gly Ala Thr 
435 

ttt ctg tct 
Phe Leu Ser 



gtt ttc ttc 
Val Phe Phe 



aat atg tgg 
Asn Met Trp 
485 

get ttt ggg 
Ala Phe Gly 
500 

gag tgc teg 
Glu Cys Ser 
515 

ggt gag get 
Gly Glu Ala 



ggc tgc age 
Gly Cys Ser 



gac 1182 
Asp 



tgg 1230 
Trp 



gac 1278 
Asp 



caa 1326 

Gin 

390 

cct 1374 
Pro 



ggc 1422 
Gly 



cgc 1470 
Arg 



cac 1518 
His 



agt 1566 

Ser 

470 

tac 1614 
Tyr 



tct 1662 
Ser 



ctg 1710 
Leu 



tgg 1758 
Trp 



gee 1806 

Ala 

550 
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gac gag ctt gat egg ate agg aaa gag gag ggg gac gac age tac gtt 
Asp Glu Leu Asp Arg lie Arg Lys Glu Glu Gly Asp Asp Ser Tyr Val 
555 560 565 



1854 



etc aag ere aag gaa gee ace tgg gtt cct cac ctg ttc cgc gte age 
Leu Lys Leu Lys Glu Ala Thr Trp Val Pro His Leu Phe Arg Val Ser 
570 575 580 



1902 



gte aca cag cat gaa tac atg aac gag aag agg cag aga ate ace gtg 
Val Thr Gin His Glu Tyr Met Asn Glu Lys Arg Gin Arg lie Thr Val 
585 590 595 



1950 



agg ggt gaa gca ccg gte gac ttc gca get gag tec aag tac ttg ctt 
Arg Gly Glu Ala Pro Val Asp Phe Ala Ala Glu Ser Lys Tyr Leu Leu 
600 605 610 



1998 



gaa gag ate gcg aag etc ace get tgc tagaagaege agtctttctg 
Glu Glu lie Ala Lys Leu Thr Ala Cys 
615 620 



2045 



gtggttcttg 
gtaacttgat 
atgtagatgc 
tgcagttcca 
attgatgatg 
tccctatatt 
gttgtgcgtg 
aaaaaaaaaa 



aaggactggc 
tactgttctg 
tagtttacct 
atrccgtgta 
ttaggtcget 
ttattctatt 
tccgatgagt 
aaaaaaaaaa 



ccccgatatg 
tgtgttgctc 
tggtgteaag 
tctgcaacct 
gcagctaaca 
ttagtattta 
ctattattga 
aaaaaaaaaa 



tctcectctc 
tcactgggtt 
gaacagatgc 
tgagcaaata 
agtgtttggt 
aggttgcgtt 
agcacaaaat 
aa 



agtttttctt 
ttagcacttc 
tattataagc 
gggaaagatt 
ttttagtgac 
tggttgcgtc 
tgggaataaa 



ttgagctcca 
tgtaaggtat 
ettgeaaaat 
atgagtacta 
tactgtttag 
gaetagacat 
aaaaaaaaaa 



2105 
2165 
2225 
2285 
2345 
2405 
2465 
2497 



<210> 2 
<21i> 623 
<212> PRT 
<213> Zea Mays 



<400> 2 

Met Asp Ala Ala Lys Ser Val Thr Pro Gly Ala Val Ser Tyr He Leu 

15 10 15 

Ala His Pro Ser Thr Gly Ser Asp Gly Ala Val Ser Asp Leu Val Val 

20 25 30 

Gin Val Leu Asp Leu Lys Ser He Gly Met Gly Ser Arg Phe Ser Phe 

35 40 45 

Thr Ala Ser Asp Gly Asn Asp Lys He Lys Ala Met Leu Pro Thr Tyr 

50 55 60 

Phe Ala Ser Glu Val His Ser Gly Asn Leu Lys Asn Phe Gly Leu He 
65 70 75 80 

Arg He Leu Asp Tyr Thr Cys Asn Ser Val Lys Gly Asn Ala Asp Lys 

85 90 95 

Val Leu He Val Val Lys Cys Glu Thr Val Cys Glu Ala Leu Asp Ala 

100 105 110 

Glu He Asn Gly Glu Ala Lys Lys Glu Asp Pro Pro lie Val Leu Lys 

115 * 120 125 

Pro Lys Asp Glu Gly Ser Val Val Ala Glu Glu Thr Asn Ser Pro Pro 

130 135 140 

Leu Val Met Lys Pro Lys Gin Glu Val Lys Ser Ala Ser Gin He Val 
145 150 155 160 

Thr Glu Gin Arg Gly Asn Ala Ala Pro Ala Thr Arg Leu Ser Met Thr 
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165 










170 










175 




Arg 


Arg 


Val 


His 


Pro 


Leu 


He 


Thr 


Leu 


Asn 


Pro 


Tyr 


Gin 


Gly Asn 


Trp 








180 










185 










190 




Val 


He 


Lys 


Val 


Arg 


Val 


Thr 


Ser 


Lys 


Gly Asn 


Leu 


Arg 


Thr 


Tvr 


Arg 






195 










200 










205 




Asn 


Ala 


Arg 


Gly 


Glu 


Gly 


Cys 


Val 


Phe 


Asn 


Vai 


Glu 


Leu 


Thr 


Asp 


Glu 




210 










215 










220 








Asp 


Gly 


Thr 


Gin 


He 


Gin 


Aia 


Thr 


Met 


Phe 


Asn 


Glu 


Aia 


Aia 


Lys 


Lys 


225 










230 










235 








240 


Phe 


Tyr 


Pro 


He 


Phe 


Glu 


Leu 


Gly 


Lys 


Val 


Tyr 


Tyr 


Val 


Ser 


Lys 


Gly 










245 










250 










255 


Ser 


Leu 


Arg 


He 


Ala 


Asn 


Lys 


Gin 


Phe 


Lys 


Thr 


Vai 


Lys 


Asn 


Asp 


Tyr 








260 










265 










270 


Glu 


Leu 


Ser 


Leu 


Asn 


Glu 


Asn 


Ala 


lie 


Vai 


Giu 


Giu 


Aia 


Giu 


Giy 


Glu 






275 










280 










285 






Thr 


Phe 


Leu 


Pro 


Pro 


Vai 


Gin 


Tyr 


Asn 


Leu 


Val 


Lys 


He 


Asp 


Gin 


Leu 




290 










295 










300 








Gly 


Pro 


Tyr 


Val 


Gly 


Gly 


Arg 


Glu 


Leu 


Val 


Asp 


He 


Val 


Gly Val 


Val 


305 










310 










315 










320 


Gin 


Ser 


Val 


Ser 


Pro 


Thr 


Leu 


Ser 


Val 


Arg 


Arg 


Lys 


lie 


Asp 


Asn 


Giu 










325 










330 










335 




Thr 


He 


Pro 


Lys 


Arg 


Asp 


He 


Val 


Val 


Ala 


Asp 


Asp 


Ser 


Gly 


Lys 


Thr 








340 










345 










350 






Val 


Thr 


He 


Ser 


Leu 


Trp 


Asn 


Asp 


Leu 


Ala 


Thr 


Thr 


Thr 


Giy 


Gin 


Glu 






355 










360 










365 








Leu 


Leu 


Asp 


Met 


Val 


Asp 


Ser 


Ser 


Pro 


Val 


Val 


Ala 


lie 


Lys 


Ser 


Leu 




370 










375 










380 








Lys 


Val 


Ser 


Asp 


Phe 


Gin 


Gly 


Val 


Ser 


Leu 


Ser 


Thr 


lie 


Gly Arg 


Ser 


385 










390 










395 










400 


Thr 


Leu 


Glu 


He 


Asn 


Pro 


Asp 


Leu 


Pro 


Glu 


Aia 


Lys 


Asn 


Leu 


Lys 


Ser 










405 










410 










415 




Trp 


Tyr 


Asp 


Ser 


Glu 


Gly 


Lys 


Asp 


Thr 


Ser 


Leu 


Ala 


Pro 


He 


Ser 


Ala 








420 










425 










430 






Glu 


Ala 


Gly 


Aia 


Thr 


Arg 


Aia 


Gly 


Gly 


Phe 


Lys 


Ser 


Met 


Tyr 


Ser 


Asp 






435 










440 










445 








Arg 


Val 


Phe 


Leu 


Ser 


His 


He 


Thr 


Ser 


Asp 


Pro 


Ala 


Met 


Gly 


Gin 


Giu 




450 










455 










460 








Lys 


Pro 


Val 


Phe 


Phe 


Ser 


Leu 


Tyr 


Ala 


He 


He 


Ser 


His 


He 


Lys 


Pro 


465 










470 










475 








480 


Asp 


Gin 


Asn 


Met 


Trp 


Tyr 


Arg 


Ala 


Cys 


Thr 


Thr 


Cys 


Asn 


Lys 


Lys 


Vai 










485 










490 










495 




Thr 


Glu 


Ala 


Phe 


Gly 


Ser 


Gly 


Tyr 


Trp 


Cys 


Glu 


Gly 


Cys 


Gin 


Lys 


Asn 








500 










505 










510 






Asp 


Ser 


Glu 


Cys 


Ser 


Leu 


Arg 


Tyr 


He 


Met 


Vai 


He 


Lys 


Leu 


Ser 


Asp 






515 










520 










525 








Pro 


Thr 


Gly 


Glu 


Ala 


Trp 


Vai 


Ser 


Val 


Phe 


Asn 


Glu 


His 


Ala 


Glu 


Lys 




530 










535 










540 








lie 


He 


Gly 


Cys 


Ser 


Aia 


Asp 


Glu 


Leu 


Asp Arg 


He 


Arg 


Lys 


Glu 


Giu 


545 










550 










555 










560 


Gly Asp 


Asp 


Ser 


Tyr 


Val 


Leu 


Lys 


Leu 


Lys 


Giu 


.Aia 


Thr 


Trp 


Val 


Pro 










565 










570 










575 




His 


Leu 


Phe 


Arg 


Vai 


Ser 


Val 


Thr 


Gin 


His 


Glu 


Tyr 


Met 


Asn 


Glu 


Lys 








580 










585 










590 






Arg 


Gin 


Arg 


lie 


Thr 


Val 


Arg 


Gly 


Glu 


Ala 


Pro 


Val 


Asp 


Phe 


Aia 


Ala 






595 










600 










605 








Glu 


Ser 


Lys 


Tyr 


Leu 


Leu 


Glu 


Glu 


He 


Ala 


Lys 


Leu 


Thr 


Ala 


Cys 






610 










615 










620 
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<210> 3 

<211> 2202 

<212> DNA 

<213> Zea Mays 

<220> 
<221> CDS 

<222> (91) . . . (1941) 

<223> Coding Region for Maize RPA Large Subunit 
Homologue-2 

<221> misc_f eature 
<222> (0) ... (0) 

<223> Maize RPA Large Subunit Homologue-2 
<400> 3 

acgttccccc cacgccccaa cctatccacg cgaaaccttc tttcccccgg gagacgattc 60 
gtcagggaga ggaaagaggc aagaggggcc atg gac get gec aag ttg gtg acg 114 

Met Asp Ala Ala Lys Leu Val Thr 
1 5 

ccg gtc get: gtg tct cac att ctg gcg cac ccg teg gcg ggc tec gac 162 
Pro Val Ala Val Ser His lie Leu Ala His Pro Ser Ala Gly Ser Asp 
10 15 20 

ggc gca gtg acc gat etc gtc gtt cag gtc etc gac ctg aag tec gtc 210 
Gly Ala Val Thr Asp Leu Val Val Gin Val Leu Asp Leu Lys Ser Val 
25 30 35 40 

ggc acg ggc age egg ttc agt ttc aca gca act gac ggg aag gat aag 258 
Gly Thr Gly Ser Arg Phe Ser Phe Thr Ala Thr Asp Gly Lys Asp Lys 
45 50 55 

ate aag gcg atg ctt ccc acc aac ttc ggg teg gag gtc cgc tct ggc 306 
lie Lys Ala Met Leu Pro Thr Asn Phe Gly Ser Glu Val Arg Ser Gly 
60 65 70 

aac ctg aag aac etc ggc etc ate cgc ate ate gac tac act tgc aac 354 
Asn Leu Lys Asn Leu Gly Leu lie Arg lie lie Asp Tyr Thr Cys Asn 
75 80 85 

gtc gtc aaa ggc aaa gat gac aaa gtc ttg gtt gtc ate aaa tgc gag 402 
Val Val Lys Gly Lys Asp Asp Lys Val Leu Val Val lie Lys Cys Glu 
90 95 100 

ctt gtg tgc caa gcg ctt gac gee gag ate aac ggc gag gee aaa aaa 450 
Leu Val Cys Gin Ala Leu Asp Ala Glu lie Asn Gly Glu Ala Lys Lys 
105 110 115 120 

gag gag cct cca att gtg ctg aag ccr aag gac gaa tgc gtg ggc gtg 498 
Glu Glu Pro Pro lie Val Leu Lys Pro Lys Asp Glu Cys Val Gly Val 



125 130 135 



act tec cca etc get atg aag ccc aag cag gag gtg aag tct gcg tec 
Thr Ser Pro Leu Ala Met Lys Pro Lys Gin Glu Val Lys Ser Ala Ser 
140 145 150 



546 
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cag ate gtg 
Gin lie Val 
155 

tec atg aca 
Ser Met Thr 
170 

ggt aac tgg 
Gly Asn Trp 
185 

acc tac agg 
Thr Tyr Arg 



acc gat gag 
Thr Asp Glu 



gca aag aag 
Ala Lys Lys 
235 

tea aaa gga 
Ser Lys Gly 
250 

aat gac tac 
Asn Asp Tyr 
2 65 

gag ggg gag 
Glu Gly Glu 



gat caa eta 
Asp Gin Leu 



ggt gtg gtt 
Gly Val Val 
315 

gac aac gag 
Asp Asn Glu 
330 

ggc aaa act 

Gly Lys Thr 
345 

ggg caa gag 

Gly Gin Glu 



aat gag cag 

Asn Glu Gin 



aag agg gtc 
Lys Arg Val 



gtc atr aag 
Val lie Lys 
190 

aat get cgc 
Asn Ala Arg 
205 

gat ggc acc 
Asp Gly Thr 
220 

ttc tat ccg 
Phe Tyr Pro 



tct ctt aga 
Ser Leu Arg 



gag atg tea 
Glu Met Ser 
270 

act tgc att 
Thr Cys lie 
285 

gga tea tat 
Gly Ser Tyr 
300 

cag age gta 
Gin Ser Vai 



aca ata ccg 
Thr lie Pro 



gtt agt ate 
Val Ser He 
350 

ctt ttg gac 

Leu Leu Asp 
365 



cgt gga aat 
Arg Gly Asn 
160 

cat cct ttg 
His Pro Leu 
175 

gtg egg gtc 
Val Arg Val 



gga gaa ggc 
Gly Glu Gly 



cag ate caa 
Gin He Gin 
225 

att ttt gag 
lie Phe Glu 
240 

att get aac 
He Aia Asn 
255 

eta aac gag 
Leu Asn Glu 



ccg caa gtg 
Pro Gin Vai 



gtc ggt ggc 
Val Gly Gly 
305 

tct ccc aca 
Ser Pro Thr 
320 

aag cgt gac 
Lys Arg Asp 
335 

tct ctt tgg 
Ser Leu Trp 



atg get gac 

Met Ala Asp 



act get cct 
Thr Ala Pro 



ate act ctg 
lie Thr Leu 
180 

acg age aaa 
Thr Ser Lys 
195 

tgt gtc ttc 
Cys Val Phe 
210 

gec acc atg 
Aia Thr Met 



ctg gga aag 
Leu Gly Lys 



aag cag ttc 
Lys Gin Phe 
250 

aat get att 
Asn Ala lie 
275 

caa tac aac 
Gin Tyr Asn 
290 

agg gaa ctt 
Arg Glu Leu 



etc agt gtc 
Leu Ser Vai 



att gtt gtg 
He Val Vai 
340 

aat gat ctt 
Asn Asp Leu 
355 

agt teg cct 

Ser Ser Pro 
370 



gtc aag ccc 
Val Lys Pro 
165 

aac ccc tac 
Asn Pro Tyr 



ggc aac ctg 
Gly Asn Leu 



aat gta gag 
Asn Val Glu 
215 

ttt aat gac 
Phe Asn Asp 
230 

gtc tat tat 
Vai Tyr Tyr 
245 

aag act gtc 

Lys Thr Vai 



gtt gaa gaa 
Val Glu Glu 



ctt gtc aag 
Leu Val Lys 

295 

gta gat att 
Val Asp He 
310 

agg aga aag 
Arg Arg Lys 
325 

gcg gat gac 
Aia Asp Asp 



get act acg 
Ala Thr Thr 



gtt gtt gcg 

Val Vai Aia 
375 



ctt 594 
Leu 



cag 642 
Gin 



aga 690 
Arg 

200 

etc 738 
Leu 



get 786 
Ala 



gtc 834 
Val 



caa 882 

Gin 



gca 930 

Ala 

280 

att 978 

lie 



gtt 1026 
Vai 



att 1074 
He 



tct 1122 
Ser 



act 1170 

Thr 

360 

ata 1218 

lie 
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aag age eta 
Lys Ser Leu 



ggc aaa agt 
Gly Lys Ser 
395 

etc aag tea 
Leu Lys Ser 
410 

att ggt gca 
He Gly Ala 
425 

tat tct gat 
Tyr Ser Asp 



ggc cag gaa 
Gly Gin Glu 



ate aag cct 
He Lys Pro 
475 

aag aag gtg 
Lys Lys Val 
490 

caa aag aat 
Gin Lys Asn 
505 

gtc tec gat 
Val Ser Asp 



gca gag aag 
Ala Glu Lys 



aaa gag gag 

Lys Glu Glu 
555 

tgg gtt cct 

Trp Val Pro 
570 

aac gag aaa 

Asn Glu Lys 
585 

cac gca get 



aaa gtg tct 
Lys Val Ser 
380 

act ctt gcg 
Thr Leu Ala 



tgg tat gac 
Trp Tyr Asp 



gaa atg ggt 
Glu Met Gly 
430 

aga gtt ttt 
Arg Val Phe 
445 

aag cct gtt 
Lys Pro Val 
460 

gac cag aac 
Asp Gin Asn 



act gaa act 
Thr Glu Thr 



gac teg gaa 
Asp Ser Glu 
510 

cct act ggc 
Pro Thr Gly 
525 

ate att ggc 
He He Gly 
540 

ggg gac gac 
Gly Asp Asp 



cac ctg ttc 
His Leu Phe 



agg cag aga 
Arg Gin Arg 

590 

gaa tec aag 



gac ttt caa 
Asp Phe Gin 
385 

att aat cct 
He Asn Pro 
400 

tct gaa ggc 
Ser Glu Gly 
415 

gee gca egg 
Ala Ala Arg 



ctg tct cac 
Leu Ser His 



ttc ttc agt 
Phe Phe Ser 
465 

atg tgg tac 
Met Trp Tyr 
480 

ttt gga tct 
Phe Gly Ser 
495 

tgc tea ctg 
Cys Ser Leu 



gag gca tgg 
Glu Ala Trp 



tgc age gee 
Cys Ser Ala 
545 

agt tat gtt 
Ser Tyr Val 
560 

cgc gtc age 
Arg Val Ser 
575 

ate act gtg 
He Thr Val 



tac ctg ctt 



ggc gtg tct 
Gly Val Ser 



gat eta cac 
Asp Leu His 



^aa gat act 
Lys Asp Thr 
420 

gee ggt ggc 
Ala Gly Gly 
4 35 

att act agt 
lie Thr Ser 
450 

ttg tat gec 
Leu Tyr Ala 



cgt get tgc 
Arg Ala Cys 



gga tac tgg 
Gly Tyr Trp 
500 

aga tac ate 
Arg Tyr He 
515 

ttc tct gtg 
Phe Ser Val 
530 

gac gag ctt 
Asp Glu Leu 



ctg aag etc 
Leu Lys Leu 



gtc aca cag 
Val Thr Gin 
580 

agg agt gaa 
Arg Ser Glu 

595 

gaa cag ata 



ctt tct act 
Leu Ser Thr 
390 

gag get cag 
Glu Ala Gin 
405 

teg ctg gca 
Ser Leu Ala 



ttc aag tec 
Phe Lys Ser 



gat cct gee 
Asp Pro Ala 
455 

acc ata age 
Thr He Ser 
470 

aag acc tgc 
Lys Thr Cys 
485 

tgc gag gga 
Cys Glu Gly 



atg gtc ate 
Met Val lie 



ttc aac gag 
Phe Asn Glu 
535 

gat egg ate 
Asp Arg He 
550 

aag gaa gee 
Lys Glu Ala 
565 

cat gaa tac 
His Glu Tyr 



gcg ccg gtc 
Ala Pro Val 



gcg aag ctt 



gta 1266 
Val 



aat 1314 
Asn 



cca 1362 
Pro 



acg 1410 

Thr 

440 

atg 1458 
Met 



cac 1506 
His 



aac 1554 
Asn 



tgc 1602 
Cys 



aag 1650 

Lys 

520 

cat 1698 
His 



agg 1746 
Arg 



acc 1794 
Thr 



aat 1842 
Asn 



gag 1890 
Glu 

600 

act 1938 
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His Ala Ala Glu Ser Lys Tyr Leu Leu Glu Gin lie Ala Lys Leu Thr 
605 610 615 

get tgatagtaga agatgcaacc ttactgcaaa tagcgaggat tattaggact 1991 
Ala 



aattgatggt gtcaggtcat tgcggcccta agctttagct ctctatcagc agtcagatgt 2051 

atcaaccatt ccctgctcta acagccatct at cagcagtc agatgtattt aaccaaaaaa 2111 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaagggcgg ccgctctaga 2171 

ggatccaagc ttacgtacgc gtgcatgcga c 2202 

<210> 4 
<211> 617 
<212> PRT 
<213> Zea Mays 

<400> 4 

Met Asp Ala Ala Lys Leu Val Thr Pro Val Ala Val Ser His lie Leu 

15 10 15 

Ala His Pro Ser Ala Gly Ser Asp Gly Ala Val Thr Asp Leu Val Val 

20 25 30 

Gin Val Leu Asp Leu Lys Ser Val Gly Thr Gly Ser Arg Phe Ser Phe 

35 40 45 

Thr Ala Thr Asp Gly Lys Asp Lys lie Lys Ala Met Leu Pro Thr Asn 

50 55 60 

Phe Gly Ser Glu Val Arg Ser Gly Asn Leu Lys Asn Leu Gly Leu lie 
65 ^70 75 80 

Arg lie lie Asp Tyr Thr Cys Asn Val Val Lys Gly Lys Asp Asp Lys 

85 90 95 

Val Leu Val Val lie Lys Cys Glu Leu Val Cys Gin Ala Leu Asp Ala 

100 105 110 

Glu lie Asn Gly Glu Ala Lys Lys Glu Glu Pro Pro lie Val Leu Lys 

115 120 125 

Pro Lys Asp Glu Cys Val Gly Val Thr Ser Pro Leu Ala Met Lys Pro 

130 135 140 

Lys Gin Glu Val Lys Ser Ala Ser Gin lie Val Asn Glu Gin Arg Gly 
145 150 155 160 

Asn Thr Ala Pro Val Lys Pro Leu Ser Met Thr Lys Arg Val His Pro 

165 170 175 

Leu lie Thr Leu Asn Pro Tyr Gin Gly Asn Trp Val lie Lys Val Arg 

180 185 190 

Val Thr Ser Lys Gly Asn Leu Arg Thr Tyr Arg Asn Ala Arg Gly Glu 

195 200 205 

Gly Cys Val Phe Asn Vai Glu Leu Thr Asp Glu Asp Gly Thr Gin lie 

210 215 220 

Gin Ala Thr Met Phe Asn Asp Ala Ala Lys Lys Phe Tyr Pro lie Phe 
225 230 235 240 

Glu Leu Gly Lys Val Tyr Tyr Val Ser Lys Gly Ser Leu Arg lie Ala 

245 250 255 

Asn Lys Gin Phe Lys Thr Val Gin Asn Asp Tyr Glu Met Ser Leu Asn 

260 265 270 

Glu Asn Aia lie Val Glu Glu Ala Glu Gly Glu Thr Cys lie Pro Gin 

275 280 285 

Val Gin Tyr Asn Leu Val Lys lie Aso Gin Leu Gly Ser Tyr Vai Gly 

290 295 300 

Gly Arg Glu Leu Val Asp lie Val Gly Val Val Gin Ser Val Ser Pro 
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305 










310 










315 










320 


Thr 


Leu 


Ser 


Val 


Arg 
325 


Arg 


Lys 


He 


Asp 


asn 
330 


Glu 


Thr 


He 


Pro 


Lys 
335 


Arg 


Asp 


He 


Val 


Val 

340 


Ala 


Asp 


Asp 


Ser 


Gly 
345 


Lys 


Thr 


Val 


Ser 


He 
350 


Ser 


Leu 


Trp Asn 


Asp 


Leu 


Ala 


Thr 


Thr 


Thr 


Gly 


Gin 


Glu 


Leu 


Leu 


Asp 


Met 


Ala 






355 










360 










365 






Asp 


Ser 
370 


Ser 


Pro 


Val 


Val 


Ala 
375 


He 


Lys 


Ser 


Leu 


Lys 
380 


Val 


Ser 


Asp 


Phe 


Gin 


Gly 


Val 


Ser 


Leu 


Ser 


Thr 


Val 


Gly 


Lys 


Ser 


Thr 


Leu 


Ala 


He 


Asn 


385 










390 










395 










400 


Pro 


Asp 


Leu 


His 


Glu 
405 


Ala 


Gin 


Asn 


Leu 


Lys 
410 


Ser 


Trp 


Tyr 


Asp 


Ser 
415 


Glu 


Gly Lys 


Asp 


Thr 


Ser 


Leu 


Ala 


Pro 


lie 


Gly Ala 


Glu 


Met 


Gly 


Ala 


Ala 








420 










425 










430 






Arg Ala 


Gly Gly 


Phe 


Lys 


Ser 


Thr Tyr 


Ser Asp Arg Val 


Phe 


Leu 


Ser 






435 










440 










445 








His 


He 
450 


Thr 


Ser 


Asp 


Pro 


Ala 
455 


Met 


Gly 


Gin 


Glu 


Lys 
460 


Pro 


Val 


Phe 


Phe 


Ser 


Leu 


Tyr 


Ala 


Thr 


He 


Ser 


His 


He 


Lys 


Pro 


Asp 


Gin 


Asn 


Met 


TrD 


465 










470 










475 










480 


Tyr Arg 


Ala 


Cys 


Lys 


Thr 


Cys 


Asn 


Lys 


Lys 


Val 


Thr 


Glu 


Thr 


Phe 


Gly 










485 










490 










495 




Ser 


Gly 


Tyr 


Trp 
500 


Cys 


Glu 


Gly 


Cys 


Gin 
505 


Lys 


Asn 


Asp 


Ser 


Glu 
510 


Cvs 

J; 


Ser 


Leu Arg 


Tyr 


lie 


Met 


Vai 


He 


Lys 


Val 


Ser 


Asp 


Pro 


Thr 


Gly 


Glu 


Ala 






515 










520 










525 








Trp 


Phe 
530 


Ser 


Val 


Phe 


Asn 


Glu 
535 


His 


Ala 


Glu 


Lys 


He 
540 


He 


Gly 


Cys 


Ser 


Ala 


Asp 


Glu 


Leu Asp 


Arg 


He 


Arg 


Lys 


Glu 


Glu 


Gly Asp Asp 


Ser 


Tyr 


545 










550 










555 










560 


Val 


Leu 


Lys 


Leu 


Lys 
565 


Glu 


Ala 


Thr 


Trp 


Val 
570 


Pro 


His 


Leu 


Phe 


Arg 
575 


Val 


Ser 


Val 


Thr 


Gin 


His 


Glu 


Tyr Asn Asn 


Glu 


Lys 


Arg 


Gin 


Arg 


lie 


Thr 








580 










585 










590 






Val 


Arg 


Ser 
595 


Glu 


Ala 


Pro 


Val 


Glu 
600 


His 


Ala 


Ala 


Glu 


Ser 
605 


Lys 


Tyr 


Leu 


Leu 


Glu 
610 


Gin 


He 


Ala 


Lys 


Leu 
615 


Thr 


Ala 

















<210> 5 
<211> 630 
<212> PRT 

<213> Oryza sativa 



<400> 5 

Met Asp Ser Asp Ala Ala Pro Ser 

1 ' 5 
Val Leu Glu Asn Ala Ser Pro Asp 
20 

Glu He Val Leu Gin Val Val Asp 

35 40 
Thr Phe Leu Ala Ser Asp Gly Lys 

50- 5S 
Thr Gin Leu Ala Pro Glu Val Arg 
65 70 
Val He Arg Val Leu Asp Tyr Thr 



Val Thr Pro Gly Ala Vai Ala Phe 

10 ' 15 

Ala Ala Thr Gly Val Pro Val Pro 
25 30 
Leu Lys Pro lie Gly Thr Arg Phe 
45 

Asp Lys lie Lys Thr Met Leu Leu 
60 

Ser Gly Asn He Gin Asn Leu Gly 

75 80 
Cys Asn Thr lie Gly Glu Lys Gin 
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85 



Glu 


Lys 


Val 


Leu 


He 


He 


Thr 


Lys 








100 










Asp 


Ser 


Glu 


He 


Lys 


Cys 


Glu 


Ala 






115 










120 


He 


Leu 


Leu 


Ser 


Pro 


Lys 


Glu 


Glu 




130 










135 




Asn 


Ala 


Pro 


Pro 


Leu 


Pro 


Pro 


Val 


145 










150 






Lys 


Ser 


Ala 


Ser 


Gin 


He 


Val 


Asn 










165 








Ala 


Ala 


Arg 


Leu 


Ala 


Met 


Thr 


Arg 








130 










Asn 


Pro 


Tyr 


Gin 


Gly 


Asn 


Trp 


He 






195 










200 


Gly 


Asn 


Leu 


Arg 


Thr 


Tyr 


Lys 


Asn 




210 










215 




Asn 


Val 


Glu 


Leu 


Thr 


Asp 


Val 


Asp 


225 










230 




Phe 


Asn 


Glu 


Ala 


Ala 


Lys 


Lys 


Phe 










245 








Val 


Tyr 


Tyr 


He 


Ser 


Lys 


Gly 


Ser 








260 










Lys 


Thr 


Val 


His 


Asn 


Asp 


Tyr 


Glu 






275 










280 


Val 


Glu 


Glu 


Ala 


Glu 


Gly 


Glu 


Thr 




290 










295 




Phe 


Val 


Lys 


He 


Asp 


Gin 


Leu 


Gly 


305 










310 




Val 


Asp 


Val 


He 


Gly 


Val 


Val 


Gin 










325 








Arg 


Arg 


Lys 


lie 


Asp 


Asn 


Glu 


Thr 








340 










Ala 


Asp 


Asp 


Ser 


Ser 


Lys 


Thr 


Val 






355 










360 


Ala 


Thr 


Thr 


Thr 


Gly 


Gin 


Glu 


Leu 




370 










375 




He 


He 


Ala 


lie 


Lys 


Ser 


Leu 


Lys 


385 










390 




Leu 


Ser 


Thr 


Val 


Gly Arg 


Ser 


Thr 










405 








Glu 


Ala 


Glu 


Gin 


Leu Arg Ala Trp 








420 










Ser 


Met 


Ala 


Ser 


He 


Gly 


Ser 


Asp 






435 










4 40 


Ala 


Arg 


Ser 


Met 


Tyr 


Ser 


Asp Arg 




450 










455 




Asp 


Pro 


Asn 


Leu 


Gly 


Gin 


Asp 


Lys 


4 6*5 










470 






Tyr 


lie 


Ser 


Leu 


He 


Lys 


Pro 


Asp 










485 








Lys 


Thr 


Cys 


Asn 


Lys 


Lys 


Val 


Thr 








500 










Cys 


Glu 


Gly 


Cys 


Gin 


Lys 


Asn Asp 






515 










520 


Met 


Vai 


He 


Lys 


Val 


Ser 


Asp 


Pro 




530 










535 







90 










95 




Leu 


Glu 


Val 


Val 


Phe 


Lys 


Ala 


Leu 


105 










110 






Glu 


Lys 


Gin 


Glu 


Glu 


Lys 


Pro 


Ala 










125 








Ser 


Vai 


Val 


Leu 


Ser 


Lys 


Pro 


Thr 








140 










Val 


Leu 


Lys 


Pro 


Lys 


Gin 


Glu 


Val 






155 










160 


Glu 


Gin 


Arq 


Gly Asn Ala 


Ala 


Pro 




170 










175 




Arg Val 


His 


Pro 


Leu 


He 


Ser 


Leu 


185 










190 






lie 


Lys 


Val 


Arg 


Vai 


Thr 


Ser 


Lvs 










205 








Ala 


Arg 


Gly 


Glu 


Gly Cys 


Vai 


Phe 








220 










Gly Thr 


Gin 


He 


Gin 


Ala 


Thr 


Met 






235 










240 


Tyr 


Pro 


Met 


Phe 


Glu 


Leu 


Gly 


Lys 




250 










255 


Leu Arg Val 


Ala 


Asn 


Lys 


Gin 


Phe 


265 










270 






Met 


Thr 


Leu 


Asn 


Glu 


Asn 


Ala 


Val 










285 








Phe 


He 


Pro 


Gin 


He 


Gin 


Tvr 


Asn 








300 










Pro 


Tyr 


Val 


Glv 


Gly Arg 


Glu 


Leu 






315 










320 


Ser 


Val 


Ser 


Pro 


Thr 


Leu 


Ser 


Val 




330 










335 




He 


Pro 


Lys 


Arg 


Asp 


He 


Vai 


Val 


345 










350 






Thr 


He 


Ser 


Leu 


Trp 


Asn 


Asp 


Leu 










3 65 








Leu 


Asp 


Met 


Val 


As p 


Ser 


Ala 


Pre 








380 










Val 


Ser 


Asp 


Phe 


Gin 


Gly 


Leu 


Ser 






395 










400 


He 


Vai 


Vai 


Asn 


Pro 


Asp 


Leu 


Pro 




410 










415 




Tyr 


Asp 


Ser 


Glu 


Gly 


Lys 


Gly 


Thr 


425 










430 






Met 


Gly Ala 


Ser 


Arg 


Val 


Gly 


Gly 










445 








Val 


Phe 


Leu 


Ser 


His 


lie 


Thr 


Ser 








460 










Pro 


Val 


Phe 


Phe 


Ser 


Leu 


Asn 


Ala 






475 










480 


Gin 


Thr 


Met 


Trp 


Tyr 


Arg 


Ala 


Cys 




490 










495 




Glu 


Ala 


Met 


Gly 


Ser 


Gly 


Tyr 


Trp 


505 










510 






Ala 


Glu 


Cys 


Ser 


Leu 


Arg 


Tyr 


He 










525 








Thr 


Gly Glu 


Ala 


Trp 


Leu 


Ser 


Leu 



540 
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Phe 


Asn 


Asp 


Gin 


Ala 


Glu 


Arg 


He 


Val Gly Cys 


Ser 


Ala 


Asd 


Glu 


Leu 


545 










550 








555 










560 


Asp Arg 


He 


Arg 


Lys 


Glu 


Glu 


Gly Asp Asp 


Ser Tyr 


Leu 


Leu 


Lys 


Leu 










565 








570 










575 




Lys 


Glu 


Ala 


Thr 


Trp 


Val 


Pro 


His 


Leu Phe 


Arg 


Val 


Ser 


Val 


Thr 


Gin 








580 










585 








590 






Asn 


Glu 


Tyr 


Met 


Asn 


Glu 


Lys 


Arg 


Gin Arg 


He 


Thr 


Val 


Arg 


Ser 


Glu 






595 










600 








605 








Ala 


Pro 


Val 


Asp 


His 


Ala 


Ala 


Glu 


Ala Lys 


Tyr 


Met 


Leu 


Glu 


Glu 


He 




610 










615 








620 










Ala 


Lys 


Leu 


Thr 


Gly 


Cys 




















625 










630 





















<210> 6 
<211> 609 
<212> PRT 

<213> Xenopus laevis 
<400> 6 



Met Ala 


Le u 


Pro 


Gin 


Leu 


Ser 


Glu 


Gly Ala 


lie 


Ser 


Ala 


i It: L. 


Leu 


Gly 


1 






5 








10 










15 




Gl v Asn 


Ser 


Ser 


Cys 


Lys 


Pro 


Thr 


Leu Gin 


Val 


He 


Asn 


lie 


Arg 


Pro 






20 










25 








in 






He As n 


Thr 


Gly 


Asn 




Pro 


Pro 


Arg Tyr 


Arg 


Leu 


Leu 


l It: L. 


Ser 


Asp 




35 










40 








45 








Glv L^u 

w _l y j_>*_ u 


As n 


Thr 


Leu 


Ser 


Ser 


Phe 


Met Leu 


Ala 


Thr 


Gin 


Lieu 


Asn 


Ser 


50 










55 








60 










Leu Val 


Asp 


Asn 


Asn 


Leu 


Leu 


Ala 


Thr Asn 


Cys 


He 


Cys 


Gin 


Val 


Ser 


65 








70 








75 










80 


Arg Phe 


lie 


Val 


Asn 


Asn 


Leu 


Lys Asp Gly Arg Arg Val 


He 


lie 


Val 








85 








90 










95 




Met Glu 


Leu 


Asp 


Val 


Leu 


Lys 


Ser 


Ala Asp 


Leu 


Val 


Met 


Gly 


Lys 


lie 






100 










105 








110 






Gly Asn 


Pro 


Gin 


Pro 


Tyr 


Asn 


Asp 


Gly Gin 


Pro 


Gin 


Pro 


Ala 


Ala 


Pro 




115 










120 








125 








Ala Pro 


Ala 


Ser 


Ala 


Pro 


Ala 


Pro 


Ala Pro 


Ser 


Lys 


Leu 


Gin 


Asn 


Asn 


130 










135 








140 










Ser Ala 


Pro 


Pro 


Pro 


Ser 


Met 


Asn 


Arg Gly Thr 


Ser 


Lys 


Leu 


Phe 


Gly 


145 








150 








155 










160 


Gly Gly 


Ser 


Leu 


Leu 


Asn 


Thr 


Pro 


Gly Gly 


Ser 


Gin 


Ser 


Lys 


Val 


Val 








165 








170 










175 




Pro He 


Ala 


Ser 


Leu 


Asn 


Pro 


Tyr 


Gin Ser 


Lys 


Trp 


Thr 


Val 


Arg Ala 






180 










185 








190 






Arg Val 


Thr 


Asn 


Lys 


Gly 


Gin 


He Arg Thr Trp 


Ser 


Asn 


Ser 


Arg 


Gly 




195 










200 








205 








Glu Gly 


Lys 


Leu 


Phe 


Ser 


He 


Glu 


Met Val 


Asp 


Glu 


Ser 


Gly 


Glu 


lie 


210 










215 








220 










Arg Ala 


Thr 


Ala 


Phe 


Asn 


Glu 


Gin Ala Asp 


Lys 


Phe 


Phe 


Ser 


lie 


lie 


225 








230 








235 










240 


Glu Val 


Asn 


Lys Val 


Tyr 


Tyr 


Phe 


Ser Lys 


Gly Thr 


Leu 


Lys 


lie 


Ala 








245 








250 










255 




Asn Lys 


Gin 


Tyr 


Thr 


Ser 


Val 


Lys 


Asn Asp 


Tyr 


Glu 


Met 


Thr 


Phe 


Asn 






260 










265 








270 






Ser Glu 


Thr 


Ser 


Val 


He 


Pro 


Cys 


Asp Asp 


Ser 


Ala 


Asp 


Val 


Pro 


Met 




275 










280 








235 








Val Gin 


Phe 


Glu 


Phe 


Val 


Ser 


He Gly Glu 


Leu 


Glu 


Ser 


Lys 


Asn 


Lys 



290 295 300 
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As p 


Th r 

1 III 


Vdl 


LcU 


Asp 


lie 


He 


Gly Val 


i n s 
j *j j 










jlL) 








TH r 


7 \7 ^ 

i-»y 


vdl 


i ii r 


i j- e 


Lys 


be r 


Asn 


Asn 










"5 o ^ 










lie 


His 


-Ue u 


1 IC — 


As o 


JCi 


Ser 


Gly 


Lys 








34 0 










345 


Gly 


Glu 


AS p 




Asp 


Lys 


Phe 


Asp 


Gly 






1 s s 

J J J 










360 


a — e 


T \T<i 




M.1 a 


Arg 


Leu 


Ser 


Asp 


Phe 




^"7 n 

J r U 










375 






Le u 




O tr L 


Ser 


i nr 


Vdl 


Met 


He 


Asn 


385 










J jU 








Lvs 


Leu 


Arg 


Ala 


i -p 


tr 1 Itr 


Asp 


Ser 


Glu 




















Ser 


He 




GI u 


■J CT £ 


Arg 


Gly Gly Gly 








4 7 0 










425 




Lys 


^e* r 
■J cr i. 


Leu 




ulU 


Val 


Lys 


Asn 






4 1 S 










440 




.Lys 


A I a 

■rVJ. a 


As p 


Tyr 


rne 


Tnr 


Ser 


Val 


Ala 




4 SO 
n J u 










455 






n l u 

Ul u 


As n 


Cy s 


Leu 


Tyr 


bin 


Ala 


Cys 


Pro 


4 ^ 










A "7 f) 

4 / U 






Va 1 
v a. j_ 


lie 


As p 


o±n 


uin 


As n 


Gly 


Leu 


Phe 










A Q Q 
4 O 0 










clu 


rne 


Pro 


As n 


Dh 

rne 


Lys 


Tyr Arg 


Leu 


















505 


Asp 




oiy 




As n 




Trp 


He 


Thr 
















520 




Ser 


He 


Leu 


Gly 


Gin 


Asn 


Ala 


Thr 


Tyr 














535 




Asn 


Glu 


Gin 


Ala 


Tyr 


Asp 


Glu 


Val 


Phe 


5 4 ^ 










n 








Tyr 


Thr 


Phe 


Arg 


Ala 


Arg 


Val 


Lys 


Leu 










c zr c 










Arg 


He 


Lys 


Ala 


Thr 


Ala 


Val 


Asp 


Val 








580 










585 


Tyr 


Ser 


Arg 


Arg 


Leu 


lie 


Met 


Asn 


He 






595 










600 
























<210> 


7 














<211> 


616 














<212> 


PRT 














<213> 


Homo sapiens 








<400> 


7 












Met 


Val Gly 


Gin 


Leu 


Ser 


Glu 


Gly Ala 


1 








5 










Gly Asp Thr 


Asn 


He 


Lys 


Pro 


lie 


Leu 








20 










25 


He 


Thr 


Thr 


Gly 


Asn 


Ser 


Pro 


Pro 


Arg 






35 










40 


Gly 


Leu 


Asn 


Thr 


Leu 


Ser 


Ser 


Phe 


Met 




50 










55 






Leu 


Val 


Glu 


Glu 


Glu 


Gin 


Leu 


Ser 


Ser 
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Cys 


Lys 


Asn 


Val 


Glu 


Glu 


Val 




315 










320 


Arg 


Giu 


Val 


Ser 


Lys 


Arg 


Ser 


330 










335 




Val 


Val 


Ser 


Thr 


Thr 


Leu 


Trp 










350 






Ser 


Arg 


Gin 


Pro 


Val 


Val 


Ala 








365 








Gly Gly 


Arg 


Ser 


Leu 


Ser 


Val 






380 










Pro 


Asp 


lie 


Pro 


Giu 


Ala 


Phe 




395 










400 


Gly Gin 


Val 


Val 


Giu 


Gly 


Thr 


410 










415 




Thr 


Gly 


Gly 


Gly 


Asn 


Thr 


Asn 










430 






Glu 


Asn 


Leu 


Gly 


His 


Gly 


Glu 








445 








Thr 


He 


Val 


Tyr 


Leu 


Arg 


Lys 






460 










Ser 


Gin 


Asp 


Cys 


Asn 


Lys 


Lys 




475 










480 


Arg 


Cys 


Glu 


Lys 


Cys 


Asn 


Lys 


490 










495 


He 


Leu 


Ser 


Ala 


Asn 


lie 


Ala 










510 






Cys 


Phe 


Gin 


Glu 


Ser 


Ala 


Glu 








525 








Leu 


Gly 


Glu 


Leu 


Lys 


Glu 


Lys 






540 










Gin 


Asn 


Ala 


Asn 


Phe 


Arg 


Ser 




555 








560 


GlU 


Thr 


Tyr Asn 


Asp 


Glu 


Ser 


570 










575 




Lys 


Pro 


Val 


Asp 


His 


Lys 


Glu 










590 






Arg 


Lys 


Met 


Ala 


Thr 


Gin 


Gly 








60S 








He 


Ala 


Ala 


lie 


Met 


Gin 


Lys 


10 










15 




Gin 


Val 


lie 


Asn 


He 


Arg 


Pro 










30 






Tyr 


Arg 


Leu 


Leu 


Met 


Ser 


Asp 








45 








Leu 


Ala 


Thr 


Gin 


Leu 


Asn 


Pro 






60 










Asn 


Cys 


Val 


Cys 


Gin 


He 


His 



75 80 
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Arg Phe lie Val Asn Thr Leu Lys Asp Gly Arg Arg Val Val He Leu 

8 5 90 95 

Met Glu Leu Glu Val Leu Lys Ser Ala Glu Ala Val Gly Val Lys lie 

100 105 ' 110 

Gly Asn Pro Val Pro Tyr Asn Glu Gly Leu Gly Gin Pro Gin Val Ala 

115 120 ^ 125 

Pro Pro Ala Pro Ala Ala Ser Pro Ala Ala Ser Ser Arg Pro Gin Pro 

130 135 140 

Gin Asn Gly Ser Ser Gly Met Gly Ser Thr Val Ser Lvs Ala Tyr Gly 
145 150 155 160 

Ala Ser Lys Thr Phe Gly Lys Ala Ala Gly Pro Ser Leu Ser His Thr 

165 170 175 

Ser Gly Gly Thr Gin Ser Lys Val Val Pro He Ala Ser Leu Thr Pro 

180 185 190 

Tyr Gin Ser Lys Trp Thr He Cys Ala Arg Val Thr Asn Lys Ser Gin 

195 200 205 

He Arg Thr Trp Ser Asn Ser Arg Gly Glu Gly Lys Leu Phe Ser Leu 

210 215 " 220 

Glu Leu Val Asp Glu Ser Gly Glu He Arg Ala Thr Ala Phe Asn Glu 
225 230 235 240 

Gin Val Asp Lys Phe Phe Pro Leu He Glu Val Asn Lys Val Tyr Tyr 

245 • 250 ' 255 

Phe Ser Lys Gly Thr Leu Lys He Ala Asn Lys Gin Phe Thr Ala Val 

260 265 270 

Lys Asn Asp Tyr Glu Met Thr Phe Asn Asn Glu Thr Ser Val Met Pro 

275 280 285 

Cys Glu Asp Asp His His Leu Pro Thr Val Gin Phe Asp Phe Thr Gly 

290 295 300 

He Asp Asp Leu Glu Asn Lys Ser Lys Asp Ser Leu Val Asp He He 
305 310 315 320 

Gly lie Cys Lys Ser Tyr Glu Asp Ala Thr Lys He Thr Val Arg Ser 

325 330 335 

Asn Asn Arg Glu Val Ala Lys Arg Asn He Tyr Leu Met Asp Thr Ser 

340 345 350 

Gly Lys Val Val Thr Ala Thr Leu Trp Gly Glu Asp Ala Asp Lys Phe 

355 360 1 365 

Asp Gly Ser Arg Gin Pro Val Leu Ala He Lys Gly Ala Arg Val Ser 

370 375 380 

Asp Phe Gly Gly Arg Ser Leu Ser Val Leu Ser Ser Ser Thr He He 
385 390 395 400 

Ala Asn Pro Asp lie Pro Glu Ala Tyr Lys Leu Arg Gly Trp Phe Asp 

405 410 415 

Ala Glu Gly Gin Ala Leu Asp Gly Val Ser He Ser Asp Leu Lys Ser 

420 425 430 

Gly Gly Val Gly Gly Ser Asn Thr Asn Trp Lys Thr Leu Tyr Glu Val 

435 440 445 

Lys Ser Glu Asn Leu Gly Gin Gly Asp Lys Pro Asp Tyr Phe Ser Ser 

450 455 460 

Val Ala Thr Val Val Tyr Leu Arg Lys Glu Asn Cys Met Tyr Gin Ala 
465 470 475 * " 480 

Cys Pro Thr Gin Asp Cys Asn Lys Lys Val He Asp Gin Gin Asn Gly 

485 490 * 495 

Leu Tyr Arg Cys Glu Lys Cys Asp Thr Glu Phe Pro Asn Phe Lys Tyr 

500 505 510 

Arg Met He Leu Ser Val Asn lie Ala Asp Phe Gin Glu Asn Gin Trp 

515 520 525 

Val Thr Cys Phe Gin Glu Ser Ala Glu Ala lie Leu Gly Gin Asn Ala 
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530 










535 






Ala 


Tyr 


Leu 


Gly Glu 


Leu 


Lys 


Asp 


Lys 


545 










550 






Val 


Phe 


Gin 


Asn 


Ala 


Asn 


Phe 


Arg 


Ser 










565 








Lys 


Val 


Glu 


Thr 
580 


Tyr 


Asn 


Asp 


Glu 


Ser 
585 


Asp 


Val 


Lys 
595 


Pro 


Val 


Asp 


Tyr 


Arg 
600 


Glu 


Ser 


He 
610 


Arg 


Arg 


Ser 


Ala 


Leu 
615 


Met 





<210> 8 
<211> 603 
<212> PRT 

<213> Drosophila melanogaster 
<400> 8 



Met 


Val 


Le u 


Ala 


Ser 


Leu 


Ser 


Th r 


^ i. y 


1 








5 








Glv 


Glu 


Val 


Val 


Asp 


Ala 


Pro 


Val 


Leu 








20 












He 


As n 


Ser 


Ala 


Ala 


As p 


Ser 


GT n 


A^g 






35 














Glv 


Lys 


x y t 


Phe 


As n 


Ser 


1 J - 


Ala 


Mat 




50 










55 






Met 


Gin 


His 


As n 


Gl V 


Glu 


Leu 


ui U 




65 










7 0 








Lys 


Tyr 


Val 


Thr 


Ser 


Leu 


Val 


Glv 

wo. y 


L vs 










85 










Leu 


He 


He 


Ser 


Glu 


Leu 


Thr 


Val 


Val 








100 










105 


Ser 


Lys 


He 


Gly 


Glu 


Pro 


Val 


Thr 


Tyr 






115 










120 




Leu 


Ala 


Pro 


Lys 


Pro 


Ala 


Val 


Thr 


Ser 




130 










135 






Lys 


Glu 


Pro 


Ser 


His 


Asn 


Asn 


Asn 


Asn 


14 5 










150 








He 


Asn 


Ser 


Gly 


Met 


Thr 


His 


Pro 


lie 










165 










Asn 


Lys 


Trp 


Val 


He 


Lys 


Ala 


Arg 


Val 








180 










185 


Thr 


Trp 


Ser 


Asn 


Ala 


Arg 


Gly 


Glu 


Gly 






195 










200 




Met 


Asp 


Glu 


Ser 


Gly 


Glu 


He 


Arg 


Ala 




210 










215 






Asp 


Lys 


Phe 


Tyr 


Asp 


Leu 


He 


Gin 


Val 


225 










230 








Lys 


Cys 


Gin 


Leu 


Lys 


Pro 


Ala 


Asn 


Lys 










245 










Ala 


Tyr 


Glu 


Met 


Thr 


Phe 


Ser 


Gly 


Glu 








260 










265 


Asp 


Thr 


Asp 


Asp 


Asp 


Pro 


He 


Pro 


Glu 






275 










280 




He 


Ser 


Asp 


Val 


Ser 


Gly 


Met 


Glu 


Asn 




290 










295 






Gly 


He 


Cys 


Lys 


Glu 


Val 


Gly 


Glu 


Leu 
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540 










Asn 


Glu 


Gin 


Ala 


Phe 


Glu 


Glu 




555 










560 


Phe 


He 


Phe 


Arg 


Vai 


Arg 


Val 


570 










575 




Arg 


He 


Lys 


Ala 


Thr 


Vai 


Met 










590 






Tyr 


Gly 


Arg 


Arg 


Leu 


Val 


Met 








605 








vai 


lie 


Ala 


Arg 


lie 


Met 


His 


10 










15 




Vjin 


lie 


Leu 


Ala 


lie 


Lys 


Lys 










30 






Tyr 


Arg 


lie 


Leu 


lie 


Ser 


Asp 








4 5 








Leu 


ai a 


Ser 


Gin 


Leu 


Asn 


Val 






60 










n u _ 

rne 


Thr 


He 


Val 


Gin 


Leu 


Asp 




/ 0 










80 


Asp 


Gly 


Ala 


Gly 


Lys 


Arg 


Val 












95 




As n 


Pro 


biy 


Ala 


CjIU 


val 


Lys 










110 






bill 


Asn 


Ala 


Ala 


Lys 


Gin 


Asp 








125 








Asn 


Ser 


Lys 


Pro 


He 


Ala 


Lys 






140 










Asn 


He 


Val 


Met 


Asn 


Ser 


Ser 




155 










160 


Ser 


Ser 


Leu 


Ser 


Pro 


Tyr 


Gin 


170 










175 




Thr 


Ser 


Lys 


Ser 


Gly 


lie 


Arg 










190 






Lys 


Leu 


Phe 


Ser 


Met 


Asp 


Leu 








205 








Thr 


Ala 


Phe 


Lys 


Glu 


Gin 


Cys 






220 










AS p 


oer 


vai 


Tyr 


Tyr 


lie 


Ser 




235 










240 


Gin 


Tyr 


Ser 


Ser 


Leu 


Asn 


Asn 


250 










255 




Thr 


Val 


Val 


Gin 


Leu 


Cys 


Giu 










270 






He 


Lys 


Tyr 


Asn 


Leu 


Val 


Pro 








285 








Lys 


Ala 


Ala 


Vai 


Asp 


Thr 


He 






300 










Gin 


Ser 


Phe 


Vai 


Ala 


Arg 


Thr 
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305 










310 










315 










320 


Thr 


Asn 


Lys 


Glu 


Phe 


Lys 


Lys 


Arg 


Asd 


He 


Thr 


Leu 


Val 


Asp 


Met 


Ser 










325 










330 










335 




Asn 


Ser 


Ala 


He 


Ser 


Leu 


Thr 


Leu 


Trp 


Gly Asp 


Asp Ala 


Val 


Asn 


Phe 








340 










345 










350 






Asp 


Gly 


His 


Val 


Gin 


Pro 


Val 


He 


Leu 


Val 


Lys 


Gly Thr Arg 


lie 


Asn 






355 










360 










365 








Glu 


Phe 


Asn 


Gly 


Gly 


Lys 


Ser 


Leu 


Ser 


Leu 


Gly 


Gly 


Gly 


Ser 


lie 


Met 




370 










375 










380 










Lys 


He 


Asn 


Pro 


Asp 


lie 


Pro 


Glu 


Ala 


His 


Lys 


Leu 


Arg 


Gly Trp 


Phe 


385 










390 










395 










400 


Asp 


Asn 


Gly 


Gly 


Gly 


Asp 


Ser 


Val 


Ala 


Asn 


Met 


Val 


Ser 


Ala 


Arg 


Thr 










405 










410 










415 




Gly 


Gly 


Gly 


Ser 


Phe 


Ser 


Thr 


Glu 


Trp 


Met 


Thr 


Leu 


Lys 


Asp 


Ala 


Arq 








420 










425 










4 30 






Ala 


Arq 


As n 


Leu 


Glv 
j 


Ser 


Glv 

Jr 


Asp 


Lys 


Pro 


Asd 


Tyr 


Phe 


Gin 


Cvs 


Lvs 






435 










440 










445 








Ala 


Val 


Val 


His 


He 


Val 


Lys 

Jr 


Gin 


Glu 


Asn 


Ala 


Phe 


Tyr 


Arg 


Ala 


Cvs 

^ Jr ° 




450 










455 










460 










Pro 


Gin 


Ser 


Asp 


Cvs 


Asn 


Lvs 


Lys 


Val 


Val 


Asp 


Glu 


Gly Asn 


Asp 


Gin 


465 










470 










475 










480 


Phe 


Arq 


Cvs 


Glu 


Lys 


Cys 


Asn 


Ala 


Leu 


Phe 


Pro 


Asn 


Phe 


Lys 


Tyr 


Arq 










485 










490 










495 




Leu 


Leu 


He 


Asn 


Met 


Ser 


He 


Gly Asp 


Trp 


Thr 


Ser 


Asn 


Arg 


Trp 


Val 








500 










505 










510 






Ser 


Ser 


Phe 


Asn 


Glu 


Val 


Gly 


Glu 


Gin 


Leu 


Leu 


Gly His 


Thr 


Ser 


Gin 






515 










520 










525 








Glu 


Val 


Gly 


Glu 


Ala 


Leu 


Glu 


Asn 


Asp 


Pro 


Ala 


Lys 


Ala 


Glu 


Gin 


lie 




530 










535 










540 










Phe 


Ser 


Ala 


Leu 


Asn 


Phe 


Thr 


Ser 


His 


He 


Phe 


Lys 


Leu 


Arg 


Cys 


Lys 


545 










550 










555 










560 


Asn 


Glu 


Val 


Tyr 


Gly Asp 


Met 


Thr 


Arg 


Asn 


Lys 


Leu 


Thr 


Val 


Gin 


Ser 










565 










570 










575 




Val 


Ala 


Pro 


He 


Asn 


His 


Lys 


Glu 


Tyr 


Asn 


Lys 


His 


Leu 


Leu 


Lys 


Glu 








580 










585 










590 






Leu 


Gin 


Glu 


Leu 


Thr 


Gly 


lie 


Gly 


Ser 


Ser 


As n 













595 600 



<210> 9 
<211> 609 
<212> PRT 

<213> Schizosaccharomyces pombe 



<400> 9 

Met Ala Glu Arg Leu Ser Val Gly 

1 5 
Asp Ala Ser Ser Phe Pro Pro Asn 
20 

Lys Glu Leu Asn Ser Asn Pro Thr 

35 40 
Val Val Leu Ser Asp Ser lie Asn 

50 * 55 

Gin Leu Asn His Leu Val Ala Glu 
65 70 
Val Gin Leu Thr Gin Phe Thr Val 
85 

Leu lie Val Leu Gly Leu Asn Val 



Ala Leu Arg lie lie Asn Thr Ser 

10 " 15 
Pro lie Leu Gin Val Leu Thr Val 
25 30 
Ser Gly Ala Pro Lys Arg Tyr Arg 
45 

Tyr Ala Gin Ser Met Leu Ser Thr 
60 

Asn Lys Leu Gin Lys Gly Ala Phe 

75 ' 80 

Asn Val Met Lys Glu Arg Lys lie 

90 95 
Leu Thr Glu Leu Gly Val Met Asp 
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100 105 



Lys 


He 


Gly 


Asn 


Pro 


Ala 


Gly 


Leu 


Glu 






115 










120 




Gin 


Gin 


Asn 


Glu 


Gin 


Asn 


Asn 


Aia 


Ser 




130 










135 






Thr 


Ser 


Thr 


Asn 


Ser 


Phe 


Tyr 


Gly 


Asn 


145 










150 








Ala 


Pro 


Pro 


Pro 


Met 


Met 


Lys 


Lys 


Pro 










165 










Thr 


He 


He 


T vr 


Pro 


lie 


Glu 


Gly 


Leu 








180 










185 


Thr 


lie 


Arg 


Ala 


Arg 


Val 


Thr 


Asn 


Lys 






195 










200 




Asn 


Gin 


Arg 


Gly 


Glu 


Gly 


Lys 


Leu 


Phe 




210 










215 






Ser 


Gly 


Glu 


He 


Arg 


Ala 


Thr 


Gly 


Phe 


225 










230 






Tyr Asp 


lie 


Leu 


Gin 


Glu 


Gly 


Ser 


Val 










*1 A C 

24 o 










Val 


Asn 


He 


Ala 


Lys 


Lys 


Gin 


Tyr 


Thr 








O C A 










265 


Leu 


Met 


Phe 


Glu 


Arg 


Asp 


Thr 


Glu 


lie 






275 










280 




Ala 


Val 


Pro 


Val 


Ala 


Lys 


Phe 


Ser 


Phe 




290 










295 






Asp 


Val 


Ala 


Lys 


Asp 


Aia 


Val 


He 


Asp 


305 










310 








Val 


Gly 


Pro 


Val 


Gin 


Gin 


He 


Thr 


Ser 










325 










Asp 


Lys 


Arg 


Asp 


He 


Thr 


He 


Val 


Asp 








340 










345 


Val 


Thr 


Leu 


Trp 


Gly 


Lys 


Thr 


Aia 


He 






355 










360 




Ser 


He 


Leu 


Ala 


Phe 


Lys 


Gly 


Val 


Lys 




370 










375 






Ser 


Leu 


Ser 


Met 


Leu 


Thr 


Ser 


Ser 


Thr 


385 










390 








Gin 


Glu 


Ser 


His 


Leu 


Leu 


Lys 


Gly 


Trp 










405 










Gin 


Glu 


Phe 


Ala 


Lys 


His 


Ser 


Val 


lie 








420 










425 


Gly Arg 


Ser 


Ala 


Glu 


Arg 


Lys 


Asn 


He 






4 J j 










440 




Leu 


Gly 


Met 


Ser 


Glu 


Thr 


Pro 


Asp 


Tyr 




450 










A C C 

4 55 






Val 


Tyr 


He 


Arg 


Lys 


Lys 


Asn 


Val 


Ser 


465 










470 








Asp 


Cys 


Asn 


Lys 


Lys 


Val 


Phe 


Asp 


Gin 










485 










Lys 


Cys 


Asn 


Lys 


Glu 


Tyr 


Asp 


Aia 


Pro 








500 










505 


He 


Ala 


Val 


Gly 


Asp 


His 


Thr 


Gly 


Gin 






515 










520 




Asp 


Val 


Gly 


Lys 


Leu 


He 


Met 


His 


Lys 




530 










535 






Leu 


Gin 


Giu 


Asn 


Asp 


Glu 


Asn 


Aia 


Phe 



545 550 



PCT/US99/21277 



110 



Thr 


Val 


Asp 


Ala 


Leu 


Arg 


Gin 








125 








Ala 


Pro 


Arg 


Thr 


Gly 


He 


Ser 






140 










Asn 


Ala 


Ala 


Aia 


Thr 


Ala 


Pro 




155 










160 


Ala 


Ala 


Pro 


Asn 


Ser 


Leu 


Ser 


170 










175 




Ser 


Pro 


Tyr 


Gin 


Asn 


Lys 


Trp 










190 






Ser 


Glu 


Val 


Lys 


His 


Trp 


His 








205 








Ser 


Val 


Asn 


Leu 


Leu 


Asp 


Glu 






220 








Asn 


Asp 


Gin 


Val 


Asp 


Aia 


Phe 




235 










240 


Tyr 


Tyr 


He 


Ser 


Arg 


Cys 


Arg 


250 










255 




Asn 


Val 


Gin 


Asn 


Glu 


Tyr 


Giu 










270 






Arg 


Lys 


Ala 


Glu 


Asp 


Gin 


Thr 








285 








Val 


Ser 


Leu 


Gin 


Glu 


Val 


Gly 






300 








Vai 


He 


Gly 


Val 


Leu 


Gin 


Asn 




315 










320 


Arg 


Ala 


Thr 


Ser 


Arg 


Giy 


Phe 


330 










335 




Gin 


Thr 


Gly 


Tyr 


Giu 


Met 


Arg 










350 






Glu 


Phe 


Ser 


Val 


Ser 


Glu 


Glu 








365 








Val 


Asn 


Asp 


Phe 


Gin 


Gly 


Arg 






380 










Met 


Ser 


Val 


Asp 


Pro 


Asp 


He 




395 










400 


Tyr 


Asp 


Gly 


Gin 


Giy 


Arg 


Giy 


410 










415 




Ser 


Ser 


Thr 


Leu 


Ser 


Thr 


Thr 










430 






Ala 


Glu 


Val 


Gin 


Aia 


Glu 


His 








445 








Phe 


Ser 


Leu 


Lys 


Gly 


Thr 


He 






460 










Tyr 


Pro 


Ala 


Cys 


Pro 


Aia 


Ala 




475 










480 


Gly 


Giy 


Ser 


Trp 


Arg 


Cys 


Glu 


490 










495 




Gin 


Tyr 


Arg 


Tyr 


lie 


lie 


Thr 










510 






Leu 


Trp 


Leu 


Asn 


Val 


Phe 


Asp 








525 








Thr 


Ala 


Asp 


Glu 


Leu 


Asn 


Asp 






540 










Met 


Asn 


Cys 


Met 


Aia 


Glu 


Aia 




555 










560 
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Cys Tyr Met Pro 

Lys Gly Glu Met 
580 

Asp Trp Lys Glu 
595 

Gin 



Tyr He Phe Gin 
565 

Arg Val Arg Tyr 

Glu Ser Lys Arg 
600 



Cys Arg Ala Lys 
570 

Thr Val Met Ser 
585 

Leu He Asn Phe 



Gin Asp Asn Phe 
575 

He Asn Gin Met 
590 

He Glu Ser Ala 
605 



<210> 10 
<211> 621 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 10 

Met Ser Ser Val Gin Leu Ser Arg Gly Asp Phe His Ser He Phe Thr 

1 5 10* 15 

Asn Lys Gin Arg Tyr Asp Asn Pro Thr Gly Gly Val Tyr Gin Val Tyr 

2 0 25 ~ * 30 

Asn Thr Arg Lys Ser Asp Gly Ala Asn Ser Asn Arg Lys Asn Leu lie 

35 40 45 

Met He Ser Asp Gly He Tyr His Met Lys Ala Leu Leu Arg Asn Gin 

50 55 60 

Ala Ala Ser Lys Phe Gin Ser Met Glu Leu Gin Arg Gly Asp He He 
65 70 75 ' * 80 

Arg Val He lie Ala Glu Pro Ala He Val Arg Glu Arg Lys Lys Tyr 

85 90 95 

Val Leu Leu Val Asp Asp Phe Glu Leu Val Gin Ser Arg Ala Asp Met 

100 105 110 

Val Asn Gin Thr Ser Thr Phe Leu Asp Asn Tyr Phe Ser Glu His Pro 

115 120 " 125 

Asn Glu Thr Leu Lys Asp Glu Asp He Thr Asp Ser Gly Asn Val Ala 

130 135 140 

Asn Gin Thr Asn Ala Ser Asn Ala Gly Val Pro Asp Met Leu His Ser 
145 150 155 160 

Asn Ser Asn Leu Asn Ala Asn Glu Arg Lys Phe Ala Asn Glu Asn Pro 

165 170 175 

Asn Ser Gin Lys Thr Arg Pro lie Phe Ala He Glu Gin Leu Ser Pro 

180 185 190 

Tyr Gin Asn Val Trp Thr lie Lys Ala Arg Val Ser Tyr Lys Gly Glu 

195 200 205 

He Lys Thr Trp His Asn Gin Arg Gly Asp Gly Lys Leu Phe Asn Val 

210 215 220 

Asn Phe Leu Asp Thr Ser Gly Glu He Arg Ala Thr Ala Phe Asn Asp 
225 230 235 240 

Phe Ala Thr Lys Phe Asn Glu lie Leu Gin Glu Gly Lys Val Tyr Tyr 

245 250 * 255 

Val Ser Lys Ala Lys Leu Gin Pro Ala Lys Pro Gin Phe Thr Asn Leu 

260 265 270 

Thr His Pro Tyr Glu Leu Asn Leu Asp Arg Asp Thr Val lie Glu Glu 

275 ~ 280 285 

Cys Phe Asp Glu Ser Asn Val Pro Lys Thr His Phe Asn Phe lie Lys 

290 295 300 

Leu Asp Ala He Gin Asn Gin Glu Val Asn Ser Asn Val Asp Val Leu 
305 * 310 315 320 

Gly He lie Gin Thr He Asn Pro His Phe Glu Leu Thr Ser Arg Ala 
325 330 335 
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Gl v 


L vs 


Lys 


Phe 


Asd 




Arg 


As D 


1 1 p 

lie 


' ill 


T 1 A 

lie 


val 


Asp 


Asp 


Ser 


Gly 








340 










345 








•q c n 
J 0 u 




Phe 


Ser 


He 


Ser 


Val 


Gly 


Leu 


A im w 


As n 


ulu 


oin 


Ala 


Leu 


Asp 


Phe 


Asn 






355 










3 6*0 










"3 C ^ 
0 DO 






Leu 


Pro 


Glu 


Glv 


Ser 


Val 


Ai a 


Ala 


11C 


T \j <s 
liy cj 


ui y 


val 


Arg 


Val 


Thr 


Asp 




370 










375 










jOU 








Phe 


Gly 


Gly 


Lvs 


Ser 


Leu 


Ser 


Met 


Gly 


£ 1 lc 


c ^ r 

1 




l nr 


Leu 


lie 


Pro 


385 










390 










J? J 










4 00 


Asn 


Pro 


Glu 


lie 


Pro 


Glu 


Ala 


1 y ~ 


Ala 


Leu 


Lys 




Trp 


Tyr 


Asp 


Ser 










405 










*i i \j 








4 ±0 




Lys 


Gly 


Arq 


As n 
420 


Ala 


Asn 


Phe 


lie 


Thr 
425 


Leu 


T v<: 
jj y o 




wlU 


r 10 
430 


bly 


Met 


Gly 

J; 


Gly 


Gin 


Ser 


Ala 


Ala 


Ser 


Leu 


Thr 


Lys 


I; lie 


ne 


Ai a 


bin 


Arg 


lie 






435 










4 4 0 










A AC. 
4 4 0 






Thr 


He 


Ala 


Arg 


Ala 


Gin 


Ala 


ui Li 


As n 


Leu 


vai y 


Arg 


Ser 


Glu 


Lys 


Gly 




450 










455 










460 






Asd 


Phe 


Phe 


Ser 


Val 


Lys 


Ala 


Ala 


He 


Co )• 
JC1 


r ne 


Leu 


Lys 


val 


Asp 


Asn 


465 










470 










d 7 S 






480 


Phe 


Ala 


T v r 


Pro 


Ala 


Cys 


Ser 


A c n 
•r\*5 ii 


U1U 


As n 


Cys 


As n 


Lys 


Lys 


Val 


Leu 










485 
















495 




Glu 


Gin 


Pro 


Asd 


Gly Thr 






uys 


oiu 


.Lys 


k^ys 


Asp 


Tnr 


Asn 


Asn 








500 










505 














Ala 


A r* n 


Pro 
515 


As n 


Trp 


Arg 


T v r 


520 


Leu 


inr 


lie 


ber 


lie 
525 


lie 


Asp 


Glu 


Thr 


As n 


Gin 


Leu 


Trp 


Leu 


Thr 


Leu 




* ^P 


As p 


vjin 


Ala 


Lys 


Gin 


Leu 




530 










535 










540 








Leu 


Gly 


Val 


Asp 


Ala 


Asn 


Thr 


Leu 


Met 


Ser 


Leu 


Lys 


Glu 


Glu 


Asp 


Pro 


545 










550 










555 






560 


Asn 


Glu 


Phe 


Thr 


Lys 


He 


Thr 


Gin 


Ser 


lie 


Gin 


Met 


Asn 


Glu 


Tyr 


Asp 










565 










570 










575 


Phe 


Arg 


He 


Arg 


Ala 


Arg 


Glu 


Asp 


Thr 


Tyr 


Asn 


Asp 


Gin 


Ser 


Arg 


He 








580 










585 










590 




Arg 


Tyr 


Thr 


Val 


Ala 


Asn 


Leu 


His 


Ser 


Leu 


Asn 


Tyr 


Arg 


Ala 


Glu 


Aia 






595 










600 








605 








Asp 


Tyr 

610 


Leu 


Ala 


Asp 


Giu 


Leu 

615 


Ser 


Lys 


Ala 


Leu 


Leu 
620 


Ala 









<210> 11 
<211> 1124 
<212> DNA 
<213> Zea mays 

<220> 

<221> m±sc_feature 
<222> (0) ... (0) 

<223> Maize RPA Middle Subunit Homologue-1 

<221> CDS 

<222> (76) . . . (894) 

<400> 11 

tcgacccacg cgtccgatcc tcccatctgc gcacccgcaa gcctattcgc cgcacctcct 60 
caggtgaccg ggaag atg atg ccg ttg age caa acc gac ttc teg ccg teg 111 
Met Met Pro Leu Ser Gin Thr Asp Phe Ser Pro Ser 
15 10 

cag ttc acc tec tec cag aat gee gec gee gac tec acc acg cct tec 159 
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Gin Phe Thr Ser Ser Gin Asn Ala Ala Ala Asp Ser Thr Thr Pro Ser 
15 20 25 

aag atg cgc ggc gcg tec age acc atg ccg etc ace gtg aag cag gtc 207 
Lys Met Arg Gly Ala Ser Ser Thr Met Pro Leu Thr Val Lys Gin Val 
30 35 40 

gtc gac gcg cag cag tct ggc acg ggc gag aag ggc get ccg ttc ate 255 
Val Asp Ala Gin Gin Ser Gly Thr Gly Glu Lys Gly Ala Pro Phe He 
45 50 55 60 

gtc aat ggc gtc gag atg get aac att cga ctt gtg ggg atg gtc aat 303 
Val Asn Gly Val Glu Met Ala Asn lie Arg Leu Val Gly Met Val Asn 
65 70 75 

gec aag gtg gag egg acg acc gat gtg acc ttc acg etc gac gat ggc 351 
Ala Lys Val Glu Arg Thr Thr Asp Val Thr Phe Thr Leu Asp Asp Gly 
80 85 90 

acc ggc cgc etc gat ttc ate aga tgg gtg aat gat get tea gat tct 399 
Thr Gly Arg Leu Asp Phe He Arg Trp Val Asn Asp Ala Ser Asp Ser 
95 100 105 



ttt gaa act get get att cag aat ggt atg tac att gcg gtc att gga 
Phe Glu Thr Ala Ala He Gin Asn Gly Met Tyr He Ala Val He Gly 
110 115 120 



447 



age etc aag gga ctg caa gag agg aag cgt get act get ttc tea ate 495 
Ser Leu Lys Gly Leu Gin Glu Arg Lys Arg Ala Thr Ala Phe Ser He 
125 ^ 130 135 140 

agg cct ata acc gat ttc aat gag gtt acg ctg cat ttc att cag tgt 543 
Arg Pro lie Thr Asp Phe Asn Glu Val Thr Leu His Phe He Gin Cys 
145 150 155 

gtt egg atg cat ata gag aac att gaa tta aag get ggc agt cct gca 591 
Val Arg Met His He Glu Asn He Glu Leu Lys Ala Gly Ser Pro Ala 
160 165 170 

cga ate agt tct tct atg gga gtg tea ttc tea aat gga ttc agt gaa 639 
Arg He Ser Ser Ser Met Gly Val Ser Phe Ser Asn Gly Phe Ser Glu 
175 180 185 

tea age aca ccg aca tct ttg aaa tec agt ccc gca ccg gtg acc age 687 
Ser Ser Thr Pro Thr Ser Leu Lys Ser Ser Pro Ala Pro Val Thr Ser 
190 195 200 

ggg tea tec gat act gat ctg cac acg cag gtc ctg aat ttt ttt aat 735 
Gly Ser Ser Asp Thr Asp Leu His Thr Gin Val Leu Asn Phe Phe Asn 
205 210 215 220 

gaa cca gcg aac etc gag agt gag cat ggg gtg cac gtt gat gaa gta 783 
Glu Pro Ala Asn Leu Glu Ser Glu His Gly Val His Val Asp Glu Val 
225 230 235 



etc aag egg ttc aaa ctt ttg ccg aag aag cag ate acg gat get att 
Leu Lys Arg Phe Lys Leu Leu Pro Lys Lys Gin He Thr Asp Ala lie 



831 
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240 245 250 

gat tac aat atg gac teg ggg cgt czz tac tea aca att gat gaa tec 879 
Asp Tyr Asn Met Asp Ser Gly Arg Leu Tyr Ser Thr He Asp Glu Phe 
255 260 265 

cac tac aag gca act taaccgattt gaaggecage ctgctggaaa tggcagagga 934 
His Tyr Lys Ala Thr 
270 

ctaagtatca cttgtactaa accaaagtct ggaaatgtca tgttgtgtca tgaaatgeat 994 

ggttggttta tggaaacatt tatatcttgt atcaactagt tgatttgtat ctcgtgtcaa 1054 

ctcaatgact gagecaagaa aaggaagatg tagaggcega cagaaaaaaa aaaaaaaaaa 1114 

aaaaaaaaaa 1124 

<210> 12 
<211> 273 
<212> PRT 
<213> Zea mays 

<400> 12 

Met Met Pro Leu Ser Gin Thr Asp Phe Ser Pro Ser Gin Phe Thr Ser 

15 10 15 

Ser Gin Asn Ala Ala Ala Asp Ser Thr Thr Pro Ser Lys Met Arg Gly 

20 25 30 

Ala Ser Ser Thr Met Pro Leu Thr Val Lys Gin Val Val Asp Ala Gin 

35 40 45 

Gin Ser Gly Thr Gly Glu Lys Gly Ala Pro Phe He Val Asn Gly Val 

50 55 60 

Glu Met Ala Asn lie Arg Leu Val Gly Met Val Asn Ala Lys Val Glu 
65 70 75 80 

Arg Thr Thr Asp Val Thr Phe Thr Leu Asp Asp Gly Thr Gly Arg Leu 

85 90 95 

Asp Phe He Arg Trp Val Asn Asp Ala Ser Asp Ser Phe Glu Thr Ala 

100 105 110 

Ala He Gin Asn Gly Met Tyr lie Ala Val He Gly Ser Leu Lys Gly 

115 120 125 

Leu Gin Glu Arg Lys Arg Ala Thr Ala Phe Ser lie Arg Pro lie Thr 

130 135 140 

Asn Phe Asn Glu Val Thr Leu His Phe He Gin Cys Val Arg Met His 
145 150 155 160 

He Glu Asn He Glu Leu Lys Ala Gly Ser Pro Ala Arg He Ser Ser 

165 17 0 17 5 

Ser Met Gly Val Ser Phe Ser Asn Gly Phe Ser Glu Ser Ser Thr Pro 

180 185 190 

Thr Ser Leu Lys Ser Ser Pro Ala Pro Val Thr Ser Gly Ser Ser Asp 

195 200 205 

Thr Asp Leu His Thr Gin Val Leu Asn Phe Phe Asn Glu Pro Ala Asn 

210 215 220 

Leu Glu Ser Glu His Gly Val His Val Asp Glu Val Leu Lys Arg Phe 
225 230 235 240 

Lys Leu Leu Pro Lys Lys Gin lie Thr Asp Ala He Asp Tyr Asn Met 

245 * 250 255 

Asp Ser Gly Arg Leu Tyr Ser Thr lie Asp Glu Phe His Tyr Lys Ala 
260 265 270 

Thr 
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<210> 13 

<211> 979 

<212> DNA 

<213> Zea mays 

<220> 

<221> misc_f eature 
<222> (0) ... (0) 

<223> Maize RPA Middle Subunit Homologue-2 and 3 

<22i> CDS 

<222> (37) . . . (855) 

<400> 13 

ttcggcacga gcgcaccrcc tcaggtgacc gggaag atg atg ccg ttg age caa 54 

Met Met Pro Leu Ser Gin 

1 5 

acc gac ttc teg ccg teg cag ttc acc tec tec cag aat gee gee gec 102 

Thr Asp Phe Ser Pro Ser Gin Phe Thr Ser Ser Gin Asn Ala Ala Ala 
10 15 20 

gac tec acc acg cct tec aag atg cgc ggc gcg tec age acc atg ccg 150 
Asp Ser Thr Thr Pro Ser Lys Met Arg Gly Ala Ser Ser Thr Met Pro 
25 30 ' ' 35 

etc acc gtg aag cag gtc gtc gac gcg cag cag tct ggc acg ggc gac 198 
Leu Thr Val Lys Gin Val Val Asp Ala Gin Gin Ser Gly Thr Gly Asp 
40 45 50 

aag ggc get ccg ttc ate gtc aat ggc gtc gag atg get aac att cga 246 
Lys Gly Ala Pro Phe lie Val Asn Gly Val Glu Met Ala Asn lie Arg 
55 60 65 70 

ett gtg ggg atg gtc aat gec aag gtg gag egg acg acc gat gtg acc 294 
Leu Val Gly Met Val Asn Ala Lys Val Glu Arg Thr Thr Asp Val Thr 
75 80 85 

ttc acg etc gac gat ggc acc ggc cgc etc gat ttc ate aga tgg gtg 342 
Phe Thr Leu Asp Asp Gly Thr Gly Arg Leu Asp Phe lie Arg Trp Val 
90 95 100 

aat gat get tea gat tct ttt gaa act get get att cag aat ggt atg 390 
Asn Asp Ala Ser Asp Ser Phe Glu Thr Ala Ala lie Gin Asn Gly Met 
105 110 115 

tac att gcg gtc att gga age etc aag gga ctg caa gag agg aag cgt 438 
Tyr lie Ala Val lie Gly Ser Leu Lys Gly Leu Gin Glu Arg Lys Arg 
120 125 130 

get act get ttc tea ate agg cct ata acc gat ttc aat gag gtt acg 486 
Ala Thr Ala Phe Ser lie Arg Pro lie Thr Asp Phe Asn Glu Val Thr 
135 140 145 150 

ctg cat ttc att cag tgt gtt egg atg cat ata gag aac att gaa tta 534 
Leu His Phe lie Gin Cys Val Arg Met His lie Glu Asn He Glu Leu 
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1S5 160 165 

aag get: ggc agt cct gca cga ate agt tct tct atg gga gtg tea ttc 582 
Lys Ala Gly Ser Pro Ala Arg He Ser Ser Ser Met Gly Val Ser Phe 
1*70 175 180 

tea aat gga tte act gaa tea age aca ccg aca tct ttg aaa tec agt 630 
Ser Asn Gly Phe Ser Glu Ser Ser Thr Pro Thr Ser Leu Lys Ser Ser 
185 190 195 

ecc gca ccg gtg ace age ggg tea tec gat act gat ctg cac acg cag 678 
Pro Ala Pro Val Thr Ser Gly Ser Ser Asd Thr Asp Leu His Thr Gin 
200 205 210 

gtc ctg aat ttt ttt aat gaa cca gcg aac etc gag agt gag cat ggg 726 
Val Leu Asn Phe Phe Asn Glu Pro Ala Asn Leu Glu Ser Glu His Gly 
215 220 225 230 

gtg cac gtt gat gaa gta etc aag egg ttc aaa ctt ttg ccg aag aag 774 
Val His Val Asp Glu Val Leu Lys Arg Phe Lys Leu Leu Pro Lys Lys 
235 240 245 

cag ate acg gat get att gat tac aat atg gac teg ggg cgt ctt tac 822 
Gin He Thr Asp Ala He Asp Tyr Asn Met Asd Ser Gly Arg Leu Tyr 
250 255 260 

tea aca att gat gaa ttc cac tac aag gca act taaccgattt gaaggecage 875 
Ser Thr He Asp Glu Phe His Tyr Lys Ala Thr 
265 270 

ctgctggaaa tggcagagga ctaagtatca cttgtactaa accaaagtct ggaaatgtca 935 
tgttgtgtca tgaaatgeat ggttggttta tggaaacaaa aaaa 979 

<210> 14 
<211> 273 
<212> PRT 
<213> Zea mays 

<400> 14 

Met Met Pro Leu Ser Gin Thr Asp Phe Ser Pro Ser Gin Phe Thr Ser 

15 10 15 

Ser Gin Asn Ala Ala Ala Asp Ser Thr Thr Pro Ser Lvs Met Arg Gly 

20 25 " 30 

Ala Ser Ser Thr Met Pro Leu Thr Val Lys Gin Val Val Asd Ala Gin 

35 40 * 45 

Gin Ser Gly Thr Gly Asp Lys Gly Ala Pro Phe He Val Asn Gly Val 

50 55 60 

Glu Met Ala Asn He Arg Leu Val Gly Met Val Asn Ala Lys Val Glu 
65 70 75 80 

Arg Thr Thr Asp Val Thr Phe Thr Leu Asp Asp Gly Thr Gly Arg Leu 

85 90 ^ 95 

Asp Phe He Arg Trp Val Asn Asp Ala Ser Asp Ser Phe Glu Thr Ala 

100 105 110 

Ala He Gin Asn Gly Met Tyr He Ala Val He Gly Ser Leu Lys Gly 

115 120 125 

Leu Gin Glu Arg Lys Arg Ala Thr Ala Phe Ser He Arg Pro He Thr 
130 135 140 



23 



WO 00/15816 



PCT/US99/21277 



Asp 


Phe 


Asn 


Glu 


Val 


Thr 


Leu 


His 


Phe 


He 


Gin 


Cys 


Val 


Arg 


Met 


His 


145 










150 










155 










160 


lie 


Glu 


Asn 


lie 


Glu 


Leu 


Lys 


Ala 


Gly 


Ser 


Pro 


Ala 


Arg 


He 


Ser 


Ser 










165 










170 










175 




Ser 


Met 


Gly 


Val 


Ser 


Phe 


Ser 


Asn 


Gly 


Phe 


Ser 


Glu 


Ser 


Ser 


Thr 


Pro 








180 










185 










190 






Thr 


Ser 


Leu 


Lys 


Ser 


Ser 


Pro 


Ala 


Pro 


Val 


Thr 


Ser 


Gly 


Ser 


Ser 


Asp 






195 










200 










205 








Thr 


Asp 


Leu 


His 


Thr 


Gin 


Val 


Leu 


Asn 


Phe 


Phe 


Asn 


Glu 


Pro 


Ala 


Asn 




210 










215 










220 










Leu 


Glu 


Ser 


Glu 


His 


Gly 


Val 


His 


Val 


Asp 


Glu 


Val 


Leu 


Lys 


Arg 


Phe 


225 










230 










235 










240 


Lys 


Leu 


Leu 


Pro 


Lys 


Lys 


Gin 


He 


Thr 


Asp 


Ala 


He 


Asp 


Tyr 


Asn 


Met 










245 










250 










255 




Asp 


Ser 


Gly Arg 


Leu 


Tyr 


Ser 


Thr 


He 


Asp 


Glu 


Phe 


His 


Tyr 


Lys 


Ala 








260 










265 










270 







Thr 



<210> 15 

<211> 1051 

<212> DNA 

<213> Zea mays 

<220> 

<221> misc_feature 
<222> (0) ... (0) 

<223> Maize RPA Middle Subunit Homologue-4 

<221> CDS 

<222> (76) . . . (894) 
<400> 15 

tcgacccacg cgtccgatcc tcccatctgc gcacccgcaa gcctattcgc cgcacctcct 60 
caggtgaccg ggaag atg atg ccg ttg age caa acc gac ttc teg ccg teg ill 

Met Met Pro Leu Ser Gin Thr Asp Phe Ser Pro Ser 
1 5 10 

cag ttc acc tec tec cag aat gee gec gee gac tec acc acg cct tec 159 
Gin Phe Thr Ser Ser Gin Asn Ala Ala Ala Asp Ser Thr Thr Pro Ser 
15 20 25 

aag atg cgc ggc gcg tec age acc atg ccg etc acc gtg aag cag gtc 207 
Lys Met Arg Gly Ala Ser Ser Thr Met Pro Leu Thr Val Lys Gin Val 
30 * 35 40 

gtc gac gcg cag cag tct ggc acg ggc gag aag ggc get ccg ttc ate 255 
Val Asp Ala Gin Gin Ser Gly Thr Gly Glu Lys Gly Ala Pro Phe He 
45 50 55 60 

gtc aat ggc gtc gag atg get aac att cga ctt gtg ggg atg gtc aat 303 
Val Asn Gly Val Glu Met Ala Asn He Arg Leu Val Gly Met Val Asn 
65 70 75 

gee aag gtg gag egg acg acc gat gtg acc ttc acg etc gac gat ggc 351 
Ala Lys Val Glu Arg Thr Thr Asp Val Thr Phe Thr Leu Asp Asp Gly 
80 85 90 
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acc ggc cgc etc gat ttc ate aga tgg gtg aat gat get tea gat tct 399 

Thr Gly Arg Leu Asp Phe lie Arg Trp Val Asn Asd Ala Ser Asp Ser 
95 100 * 105 

ttt gaa act get get att cag aat ggt atg tac att gcg gtc att gga 447 

Phe Glu Thr Ala Ala lie Gin Asn Gly Met Tyr lie Ala Val lie Glv 
110 115 120 

age etc aag gga ctg caa gag agg aag cgt get act get ttc tea ate 495 

Ser Leu Lys Gly Leu Gin Glu Arg Lys Arg Ala Thr Ala Phe Ser He 

125 130 135 140 

agg cct ata acc gat ttc aat gag gtt acg ctg cat ttc att cag tgt 543 

Arg Pro He Thr Asp Phe Asn Glu Val Thr Leu His Phe He Gin Cys 
145 150 155 

gtt egg atg cat ata gag aac act gaa tta aag get ggc agt cct gca 591 

Val Arg Met His He Glu Asn Thr Glu Leu Lys Ala Gly Ser Pro Ala 
160 165 " 170 

cga ate aat tct tct atg gga gtg tea ttc tea aat gga ttc agt gaa 639 

Arg He Asn Ser Ser Met Gly Val Ser Phe Ser Asn Gly Phe Ser Glu 
175 180 185 

tea age aca ccg aca tct ttg aaa tec agt ccc gca ccg gtg acc age 687 

Ser Ser Thr Pro Thr Ser Leu Lys Ser Ser Pro Ala Pro Val Thr Ser 
190 195 200 

ggg tea tec gat act gat ctg cac acg cag gtc ctg aat ttt ttt aat 735 

Gly Ser Ser Asp Thr Asp Leu His Thr Gin Val Leu Asn Phe Phe Asn 

205 210 215 220 

gaa cca gcg aac etc gag agt gag cat ggg gtg cac gtt gat gaa gta 783 

Glu Pro Ala Asn Leu Glu Ser Glu His Gly Val His Val Asp Glu Val 

225 220 ' 235 

etc aag egg ttc aaa ctt ttg ccg aag aag cag ate acg gat get att 331 

Leu Lys Arg Phe Lys Leu Leu Pro Lys Lys Gin lie Thr Asp Ala He 
240 245 250 

gat tac aat atg gac teg ggg cgt ctt tac tea aca att gat gaa ttc 879 

Asp Tyr Asn Met Asp Ser Gly Arg Leu Tyr Ser Thr lie Asp Glu Phe 
255 260 265 

cac tac aag gca act taaccgattt gaaggtcagc ctgctggaaa tggcagagga 934 
His Tyr Lys Ala Thr 
27 0 

ctaagtatca cttgtactaa accaaagtct ggaaatgtca tgttgtgtca tgaaatgeat 994 
ggttggttta tggaaacaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 1051 

<210> 16 
<211> 273 
<212> PRT 
<213> Zea mays 
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<400> 16 



Met 


Met 


Pro 


Leu 


Ser 


Gin Thr Asp 


Phe 


Ser 


Pro 


Ser 


Gin 


Phe 


Thr 


Ser 


1 








5 










10 










15 




Ser 


Gin 


Asn 


Ala 


Ala 


Ala 


Asp 


Ser 


Thr 


Thr 


Pro 


Ser 


Lys 


Met 


Arg 


Gly 








20 










25 










30 






Ala 


Ser 


Ser 


Thr 


Met 


Pro 


Leu 


Thr 


Val 


Lys 


Gin 


Val 


Val 


Asp 


Ala 


Gin 






35 










40 










45 








Gin 


Ser 


Gly 


Thr 


Gly 


Glu 


Lys 


Gly 


Ala 


Pro 


Phe 


He 


Val 


Asn 


Gly Val 




50 










55 










60 










Glu 


Met 


Ala 


Asn 


He 


Arg 


Leu 


Val 


Gly Met 


Val 


Asn 


Ala 


Lys 


Val 


Glu 


65 










70 










75 








80 


Arg 


Thr 


Thr 


Asp 


Val 


Thr 


Phe 


Thr 


Leu Asp 


Asp 


Gly Thr 


Gly Arg 


Leu 










85 










90 










95 




Asp 


Phe 


He 


Arg 


Trp 


Val Asn Asp 


Ala 


Ser 


Asp 


Ser 


Phe 


Glu 


Thr 


Ala 








100 










105 










110 






Ala 


lie 


Gin 


Asn 


Gly 


Met 


Tyr 


He 


Ala 


Val 


He 


Gly 


Ser 


Leu 


Lys 


Gly 






115 










120 










125 








Leu 


Gin 


Glu 


Arg 


Lys 


Arg Ala Thr 


Ala 


Phe 


Ser 


He 


Arg 


Pro 


He 


Thr 




130 










135 










140 










Asp 


Phe 


Asn 


Glu 


Val 


Thr 


Leu 


His 


Phe 


He 


Gin 


Cys 


Val 


Arg 


Met 


His 


145 










150 










155 










160 


lie 


Glu 


Asn 


Thr 


Glu 


Leu 


Lys 


Ala 


Gly 


Ser 


Pro 


Ala 


Arg 


He 


Asn 


Ser 










165 










170 










175 




Ser 


Met 


Gly Val 


Ser 


Phe 


Ser 


Asn 


Gly 


Phe 


Ser 


Glu 


Ser 


Ser 


Thr 


Pro 








180 










185 










190 






Thr 


Ser 


Leu 


Lys 


Ser 


Ser 


Pro 


Ala 


Pro 


Val 


Thr 


Ser 


Gly 


Ser 


Ser 


Asp 






195 










200 










205 








Thr 


Asp 


Leu 


His 


Thr 


Gin 


Val 


Leu 


Asn 


Phe 


Phe 


Asn 


Glu 


Pro 


Ala 


Asn 




210 










215 










220 










Leu 


Glu 


Ser 


Glu 


His 


Gly Val 


His 


Val 


Asp 


Glu 


Val 


Leu 


Lys 


Arg 


Phe 


225 










230 










235 










240 


Lys 


Leu 


Leu 


Pro 


Lys 


Lys 


Gin 


He 


Thr 


Asp 


Ala 


He 


Asp 


Tyr 


Asn 


Met 










245 










250 










255 




Asp 


Ser 


Gly Arg 


Leu 


Tyr 


Ser 


Thr 


He 


Asp 


Glu 


Phe 


His 


Tyr 


Lys 


Ala 








260 










265 










270 







Thr 



<210> 17 
<211> 1087 
<212> DNA 
<213> Zea mays 

<220> 

<221> misc_feature 
<222> (0) . . . (0) 

<223> Maize RPA Middle Subunit Homologue-5 
<221> CDS 

<222> (91) . . . (1044) 
<400> 17 

aattccgggg ccgacccacg cgtccgcatc gatcctccca tctgcgcacc cgcaagccta 60 
ttcgccgcac ctcctcaggt gaccgggaag atg atg ccg ttg age caa acc gac 114 

Met Met Pro Leu Ser Gin Thr Asp 
1 5 
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ttc teg ccg teg cag ttc acc tec tec cag aac gec gee gee gac tec 162 
Phe Ser Pro Ser Gin Phe Thr Ser Ser Gin Asn Ala Ala Ala Asp Ser 
10 15 20 

acc acg cct tec aag atg cgc ggc gcg tec age acc atg ccg etc acc 210 
Thr Thr Pro Ser Lys Met Arg Gly Ala Ser Ser Thr Met Pro Leu Thr 
25 30 35 40 

gtg aag car gtc gtc gac gcg cag cag tct ggc acg ggc gag aag ggc 258 
Val Lys Xaa Val Val Asp Ala Gin Gin Ser Gly Thr Gly Glu Lvs Gly 
45 50 55 

get ccg ttc ate gtc aat ggc gtc gag atg get aac att cga ctt gtg 306 
Ala Pro Phe lie Val Asn Gly Val Glu Met Ala Asn lie Arg Leu Val 
60 65 70 

ggg atg gtc aat gec aag gtg gag egg acg acc gat gtg acc ttc acg 354 
Gly Met Val Asn Ala Lys Val Glu Arg Thr Thr Asp Val Thr Phe Thr 
75 80 85 

etc gac gat ggc acc ggc cgc etc gat ttc ate aga tgg gtg aat gat 402 
Leu Asp Asd Gly Thr Gly Arg Leu Asp Phe lie Arg Trp Val Asn Asp 
90 " 95 100 

get tea gat tct ttt gaa act get get att cag aat ggt atg tac att 450 
Ala Ser Asp Ser Phe Glu Thr Ala Ala lie Gin Asn Gly Met Tyr lie 
105 110 115 120 



gcg gtc att gga age etc aag gga ctg caa gag agg aag cgt get act 
Ala Val lie Gly Ser Leu Lys Gly Leu Gin Glu Arg Lys Arg Ala Thr 
125 130 135 



ccg gtg acc age ggg tea tec gat act gat ctg cac acg cag gtc ctg 
Pro Val Thr Ser Gly Ser Ser Asp Thr Asp Leu His Thr Gin Val Leu 
205 210 215 

aat ttt ttt aat gaa cca gcg aac etc gag agt gag cat ggg gtg cac 
Asn Phe Phe Asn Glu Pro Ala Asn Leu Glu Ser Glu His Gly Val His 
220 225 230 

gtt gat gaa gta etc aag egg ttc aac ttt tgc cga aga age aga tea 



498 



get ttc tea ate agg cct ata acc gat ttc aat gag gtt acg ctg cat 546 
Ala Phe Ser lie Arg Pro lie Thr Asp Phe Asn Glu Val Thr Leu His 
140 145 150 

ttc att cag tgt gtt egg atg cat ata gag aac act gaa tta aag get 594 
Phe lie Gin Cys Val Arg Met His lie Glu Asn Thr Glu Leu Lys Ala 
155 ' 160 165 

ggc agt cct gca cga ate aat tct tct atg gga gtg tea ttc tea aat 642 
Gly Ser Pro Ala Arg lie Asn Ser Ser Met Gly Val Ser Phe Ser Asn 
170 175 180 

gga ttc agt gaa tea age aca ccg aca tct ttg aaa tec agt cce gca 690 
Gly Phe Ser Glu Ser Ser Thr Pro Thr Ser Leu Lys Ser Ser Pro Ala 
185 190 195 200 



738 



786 



834 
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Val Asp Glu Val Leu Lys Arg Phe Asn Phe Cys Arg Arg Ser Arg Ser 
235 240 " 245 

egg atg eta ttg att aca ata tgg act egg ggc gtc ttt act caa caa 882 
Arg Met Leu Leu He Thr He Trp Thr Arg Gly Val Phe Thr Gin Gin 
250 255 260 

ttg atg aat tec act aca agg caa ctt aac cga ttt gaa ggt cag cct 930 
Leu Met Asn Ser Thr Thr Arg Gin Leu Asn Arg Phe Glu Gly Gin Pro 
265 270 275 280 

get gga aat ggc aga gga eta agt ate act tgt act aaa cca aag tct 978 
Ala Gly Asn Gly Arg Gly Leu Ser He Thr Cys Thr Lys Pro Lys Ser 
285 290 295 

gga aat gtc atg ttg tgt cat gaa atg cat ggt tgg ttt atg gaa aca 1026 
Gly Asn Val Met Leu Cys His Glu Met His Gly Trp Phe Met Glu Thr 
300 305 310 

ttt ata tct tgt ate aac tagttgattt gtatctcttg tgtcaaaaaa 1074 
Phe He Ser Cys He Asn 
315 

aaaaaaaaaa aaa 1087 

<210> 18 

<211> 318 

<212> PRT 

<213> Zea mays 

<220> 

<221> VARIANT 

<222> (1) . . . (318) 

<223> Xaa = Any Amino Acid 

<400> 18 

Met Met Pro Leu Ser Gin Thr Asp Phe Ser Pro Ser Gin Phe Thr Ser 

15 10 15 

Ser Gin Asn Ala Ala Ala Asp Ser Thr Thr Pro Ser Lys Met Arg Gly 

20 25 30 

Ala Ser Ser Thr Met Pro Leu Thr Val Lys Xaa Val Val Asp Ala Gin 

35 40 45 

Gin Ser Gly Thr Gly Glu Lys Gly Ala Pro Phe He Val Asn Gly Val 

50 ' 55 60 

Glu Met Ala Asn lie Arg Leu Val Gly Met Val Asn Ala Lys Val Glu 
65 70 75 80 

Arg Thr Thr Asp Val Thr Phe Thr Leu Asp Asp Gly Thr Gly Arg Leu 

85 90 95 

Asp Phe He Arg Trp Val Asn Asp Ala Ser Asp Ser Phe Glu Thr Ala 

100 105 110 

Ala He Gin Asn Gly Met Tyr lie Ala Val He Gly Ser Leu Lys Gly 

115 120 125 

Leu Gin Glu Arg Lys Arg Ala Thr Ala Phe Ser He Arg Pro He Thr 

130 135 140 

Asp Phe Asn Glu Val Thr Leu His Phe He Gin Cys Val Arg Met His 
145 150 155 160 

He Glu Asn Thr Glu Leu Lys Ala Gly Ser Pro Ala Arg lie Asn Ser 
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165 170 175 

Ser Met Gly Val Ser Phe Ser Asn Gly Phe Ser Glu Ser Ser Thr Pro 

180 185 190 

Thr Ser Leu Lys Ser Ser Pro Ala Pro Val Thr Ser Gly Ser Ser Asd 

195 200 205 

Thr Asp Leu His Thr Gin Val Leu Asn Phe Phe Asn Glu Pro Ala Asn 

210 215 220 

Leu Glu Ser Glu His Gly Val His Val Asp Glu Val Leu Lys Arg Phe 
225 230 235 240 

Asn Phe Cys Arg Arg Ser Arg Ser Arg Met Leu Leu lie Thr lie Tro 

245 250 255 

Thr Arg Gly Val Phe Thr Gin Gin Leu Met Asn Ser Thr Thr Arg Gin 

260 265 270 

Leu Asn Arg Phe Glu Gly Gin Pro Ala Gly Asn Gly Arg Gly Leu Ser 

275 280 " 285 

lie Thr Cys Thr Lys Pro Lys Ser Gly Asn Val Met Leu Cys His Glu 

290 295 " 300 

Met His Gly Trp Phe Met Glu Thr Phe He Ser Cys He Asn 
305 310 315 

<210> 19 

<211> 1074 

<212> DNA 

<213> Zea mays 

<220> 

<221> misc_f eature 
<222> (0) ... (0) 

<223> Maize RPA Middle Subunit Homologue-6 
<221> CDS 

<222> (55) . . . (873) 
<400> 19 

gacccacgcg tccgcgcaag cctattcgcc gcacctcctc aggtgaccgg gaag atg 57 

Met 
1 

atg ccg ttg age caa acc gac ttc teg ccg teg cag ttc ace tec tec 105 
Met Pro Leu Ser Gin Thr Asp Phe Ser Pro Ser Gin Phe Thr Ser Ser 
5 10 15 

cag aat gec gec gee gac tec acc acg cct tec aag atg cgc ggc gcg 153 
Gin Asn Ala Ala Ala Asp Ser Thr Thr Pro Ser Lys Met Arg Gly Ala 
20 25 30 

tec age acc atg ccg etc acc gtg aag cag gtc gtc gac gcg cag cag 201 
Ser Ser Thr Met Pro Leu Thr Val Lys Gin Val Val Asp Ala Gin Gin 
35 40 45 

tct ggc acg ggc gag aag ggc get ccg ttc ate gtc aat ggc gtc gag 249 
Ser Gly Thr Gly Glu Lys Gly Ala Pro Phe He Val Asn Gly Val Glu 
50 55 60 f 65 

atg get aac att cga ctt gtg ggg atg gtc aat gee aag gtg gag egg 297 
Met Ala Asn He Arg Leu Val Gly Met Val Asn Ala Lys Val Glu Arg 
70 75 80 
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acg acc gat gtg acc ttc acg etc gac gat ggc acc ggc cgc etc gat 
Thr Thr Asp Val Thr Phe Thr Leu Asp Asp Gly Thr Gly Arg Leu Asp 
85 90 95 



345 



ttc ate aga tgg gtg aat gat get tea gat tct ttt gaa act get get 
Phe He Arg Trp Val Asn Asp Ala Ser Asp Ser Phe Glu Thr Ala Ala 
100 105 110 



393 



att cag aat ggt atg tac att gcg gtc att gga age etc aag gga ctg 
He Gin Asn Gly Met Tyr He Ala Val He Gly Ser Leu Lys Gly Leu 
115 120 125 



441 



caa gag agg aag cgt get act get ttc tea ate agg cct ata acc gat 
Gin Glu Arg Lys Arg Ala Thr Ala Phe Ser He Arg Pro He Thr Asp 
130 135 140 145 



489 



ttc aat gag gtt acg ctg cat ttc att cag tgt gtt egg atg cat ata 
Phe Asn Glu Val Thr Leu His Phe He Gin Cys Val Arg Met His He 
150 155 160 



537 



gag aac act gaa tta aag get ggc agt cct gca cga ate aat tct tct 
Glu Asn Thr Glu Leu Lys Ala Gly Ser Pro Ala Arg He Asn Ser Ser 
165 170 175 



585 



atg gga gtg tea ttc tea aat gga ttc agt gaa tea age aca ccg aca 
Met Gly Val Ser Phe Ser Asn Gly Phe Ser Glu Ser Ser Thr Pro Thr 
180 185 190 



633 



tct ttg aaa tec agt ccc gca ccg gtg acc age ggg tea tec gat act 
Ser Leu Lys Ser Ser Pro Ala Pro Val Thr Ser Gly Ser Ser Asp Thr 
195 ^ 200 205 



681 



gat ctg cac acg cag gtc ctg aat ttt ttt aat gaa cca gcg aac etc 
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aaaaggaaaa aaaaaaaaaa a 1074 
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tcccgggtcg acccacgcgt ccgcgatcct cccatctgcg cacccgcaag cctattcgcc 60 
gcacctcctc aggtgaccgg gaag atg atg ccg ttg age caa acc gac ttc 111 

Met Met Pro Leu Ser Gin Thr Asp Phe 
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teg ccg teg cag tec acc tec tec cag aat gec gee gee gac tec ace 159 
Ser Pro Ser Gin Phe Thr Ser Ser Gin Asn Ala Ala Ala Asp Ser Thr 
10 15 20 25 

acg cct tec aag atg cgc ggc gcg tec age acc atg ccg etc acc gtg 207 
Thr Pro Ser Lys Met Arg Gly Ala Ser Ser Thr Met Pro Leu Thr Val 
30 35 40 

aag cag gtc gtc gac gcg cag cag tct ggc acg ggc gag aag ggc get 255 
Lys Gin Val Val Asp Ala Gin Gin Ser Gly Thr Gly Glu Lys Gly Ala 
45 50 55 

ccg ttc ate gtc aat ggc gtc gag atg get aac att cga ctt gtg ggg 303 
Pro Phe lie Val Asn Gly Val Glu Met Ala Asn lie Arg Leu Val Gly 
60 65 70 

atg gtc aat gec aag gtg gag egg acg acc gat gtg acc ttc acg etc 351 
Met Val Asn Ala Lys Val Glu Arg Thr Thr Asp Val Thr Phe Thr Leu 
75 80 85 

gac gat ggc acc ggc cgc etc gat ttc ate aga tgg gtg aat gat get 399 
Asp Asp Gly Thr Gly Arg Leu Asp Phe lie Arg Trp Val Asn Asp Ala 
90 95 100 105 

tea gat tct ttt gaa act get get att cag aat ggt atg tac att gcg 447 
Ser Asp Ser Phe Glu Thr Ala Ala lie Gin Asn Gly Met Tyr lie Ala 
110 115 ' 120 

gtc att gga age etc aag gga ctg caa gag agg aag cgt get act get 495 
Val lie Gly Ser Leu Lys Gly Leu Gin Glu Arg Lys Arg Ala Thr Ala 
125 130 135 

ttc tea ate agg cct ata acc gat ttc aat gag gtt acg ctg cat ttc 543 
Phe Ser lie Arg Pro lie Thr Asp Phe Asn Glu Val Thr Leu His Phe 
140 145 150 

att cag tgt. gtt egg atg cat ata gag aac act gaa tta aag get ggc 591 
lie Gin Cys Val Arg Met His He Glu Asn Thr Glu Leu Lys Ala Gly 
155 160 165 

agt cct gca cga ate aat tct tct atg gga gtg tea ttc tea aat gga 639 
Ser Pro Ala Arg He Asn Ser Ser Met Gly Val Ser Phe Ser Asn Gly 
170 175 180 185 

ttc agt gaa tea age aca ccg aca tct ttg aaa tec agt ccc gca ccg 687 
Phe Ser Glu Ser Ser Thr Pro Thr Ser Leu Lys Ser Ser Pro Ala Pro 
190 195 200 

gtg acc age ggg tea tec gat act gat ctg cac acg cag gtc ctg aat 735 
Val Thr Ser Gly Ser Ser Asp Thr Asp Leu His Thr Gin Val Leu Asn 
205 210 215 

ttt ttt aat gaa cca gcg aac etc gag agt gag cat ggg gtg cac gtt 783 
Phe Phe Asn Glu Pro Ala Asn Leu Glu Ser Glu His Gly Val His Val 
220 225 230 
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gat gaa gta etc aag egg ttc aaa ctt ttg ccg aag aag cag ate acg 
Asp Glu Val Leu Lys Arg Phe Lys Leu Leu Pro Lys Lys Gin lie Thr 
235 240 245 



831 



gat get att gat tac aat atg gac teg ggg cgt ctt tac tea aca att 
Asp Ala lie Asp Tyr Asa Met Asp Ser Gly Arg Leu Tyr Ser Thr He 
250 255 260 2 65 
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gat gaa ttc cac tac aag gca act taaccgattt gaaggtcagc ctgctggaaa 
Asp, Glu Phe His Tyr Lys Ala Thr 
270 
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tggcagagga ctaagtatca cttgtactaa accaaagtct ggaaatgtca tgttgtgtca 993 
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tgtagattgg ctgatagctg attegggtag ctggtccaat tgcaatctgg ggcccaataa 1173 

ttcagatgca aaagcagaaa gatatttcaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 1231 
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